Leukemia inhibitory factor receptors and fusion proteins

ABSTRACT

Leukemia inhibitory factor receptor (LIF-R) proteins, DNAs and expression vectors encoding LIF-R, and processes for producing LIF-R as products of recombinant cell culture, are disclosed.

CROSS-REFERENCE TO RELATED APPLICATION

This application is a divisional of U.S. application Ser. No. 07/943,843, filed Sep. 11, 1992, now U.S. Pat. No. 5,284,755, which is a continuation-in-part of U.S. application Ser. No. 07/670,608, filed Mar. 13, 1991, now abandoned, which is a continuation-in-part of U.S. application Ser. No. 07/626,725, filed Dec. 13, 1990, now abandoned.

BACKGROUND OF THE INVENTION

The present invention relates generally to cytokine receptors, and more specifically, to leukemia inhibitory factor receptors.

Leukemia inhibitory factor (LIF) is a polypeptide hormone which plays a central role in the regulation of diverse adult and embryonic systems. LIF acts on a variety of cell types and has multiple biological activities. The diversity in biological activity is reflected in the various synonyms of LIF, which include hepatocyte stimulating factor III (HSF III); Baumann and Wong, J. Immunol. 143:1163, 1989); cholinergic nerve differentiation factor (CNDF; Yamamori et al., Science 246:1412, 1990); melanoma-derived lipoprotein lipase inhibitor (MLPLI; Mori et al., Biochem. Biophys Res. Comm. 160:1085, 1989); human interleukin for DA cells (HILDA; Moreau et al., Nature 336:690, 1988); differentiation factor (D-factor; Tomida et al., J. Biol. Chem. 259:10978, 1984); differentiation inhibitory factor (DIF; Abe et at., J. Biol. Chem. 264:8941, 1989); differentiation inhibitory activity (DIA; Smith and Hooper, Devel. Biol. 121:1, 1987); and differentiation retarding factor (DRF; Koopman and Cotton, Exp. Cell. Res. 154:233, 1984).

The diversity of biological activities ascribed to LIF, whether differentiation inhibition or stimulation, proliferation or functional activation, is mediated by specific plasma membrane receptors which bind LIF. Despite the wide range of biological activities mediated by LIF, it is believed that LIF receptors (LIF-R) are highly conserved in a variety of species and expressed on a large variety of cells, since the ligand is highly conserved between species (Gough et al., Proc. Natl. Acad. Sci. USA 85:2623, 1988; Yamamori et al., Science 246:1412, 1990). LIF receptors have been identified by ligand affinity cross-linking techniques on various cell lines, including monocyte-macrophages (Hilton, et al., Proc. Natl. Acad. Sci. USA 85:5971, 1988), and also on some non-hematopoietic cells including osteoblasts, placental trophoblasts, and liver parenchymal cells (Metcalf et al., Blood 76:50, 1990). Such studies indicate that LIF-R has a molecular weight of 90 kDa (Jacques et al., 5th Symposium sur les Marqueurs de l'inflammation, Lyon 25-27 Sep. 1990, Abstract No. 37, page 122 (bioMerieux sa, Lyon, France). Characterization of LIF receptors by Scatchard analysis of binding isotherms has demonstrated that specific cell surface receptor molecules from a variety of target cells have approximately the same affinity for LIF (40-100 pM) and are present in low numbers (150 to 2,500 receptors per cell) on all cells types tested.

In order to study the structural and biological characteristics of LIF-R and the role played by LIF-R in the responses of various cell populations to LIF stimulation, or to use LIF-R effectively in therapy, diagnosis, or assay, homogeneous compositions are needed. Such compositions are theoretically available via purification of receptors expressed by cultured cells, or by cloning and expression of genes encoding the receptors. Prior to the present invention, however, several obstacles prevented these goals from being achieved.

First, although some cell lines have been identified which express LIF-R, such cell lines express LIF-R only in very low numbers (150 to 2,500 receptors/cell), which has impeded efforts to purify receptors in amounts sufficient for obtaining amino acid sequence information or generating monoclonal antibodies. The low numbers of receptors has also precluded any practical translation assay-based method of cloning.

Second, even if LIF-R protein compositions of sufficient purity could be obtained to permit N-terminal protein sequencing, the degeneracy of the genetic code may not permit one to define a suitable probe without considerable additional experimentation. Many iterative attempts may be required to define a probe having the requisite specificity to identify a hybridizing sequence in a cDNA library. Although direct expression cloning techniques avoid the need for repetitive screening using different probes of unknown specificity and have been useful in cloning other receptors (e.g., IL-1R), they have not been shown to be sufficiently sensitive to identify LIF-R clones from cDNA libraries derived from cells expressing low numbers of LIF-R.

Thus, efforts to purify the LIF-R or to clone or express genes encoding LIF-R have been impeded by lack of purified receptor or a suitable source of receptor mRNA.

SUMMARY OF THE INVENTION

The present invention provides purified leukemia inhibitory factor receptor (LIF-R) and isolated DNA sequences encoding LIF-R, e.g., human and murine LIF-R, and analogs thereof. Preferably, such isolated DNA sequences are selected from the group consisting of (a) DNA sequences comprising a nucleotide sequence derived from the coding region of a native LIF-R gene; (b) DNA sequences capable of hybridization to a DNA of (a) under moderately stringent conditions and which encode biologically active LIF-R; and (c) DNA sequences which are degenerate as a result of the genetic code to a DNA sequence defined in (a) or (b) and which encode biologically active LIF-R. Examples of the DNA sequences of (a) are cDNA clones comprising the coding region of the DNA sequence presented in SEQ ID NO:1 (human clone 65), SEQ ID NO:3 (murine clone 3), or SEQ ID NO:5 (composite full length human LIF-R sequence). Isolated DNA sequences of the present invention may comprise cDNA, PCR-amplified DNA, genomic DNA lacking introns, chemically synthesized DNA, or combinations thereof. The present invention also provides recombinant expression vectors comprising the DNA sequences defined above, recombinant LIF-R proteins produced using the recombinant expression vectors, and processes for producing the recombinant LIF-R proteins utilizing the expression vectors.

The present invention also provides substantially homogeneous preparations of LIF-R protein. LIF-R proteins have the sequence of amino acids shown, for example, in SEQ ID NO:2 and SEQ ID NO:6 (both human LIF-R) and SEQ ID NO:4 (murine LIF-R). Homodimeric forms of the LIF-R proteins are also provided.

The present invention also provides compositions for use in therapy, diagnosis, assay for LIF or LIF-R, or in raising antibodies to LIF-R, comprising effective quantities of the LIF-R proteins of the present invention.

BRIEF DESCRIPTION OF THE FIGURES

These and other aspects of the present invention will become evident upon reference to the following detailed description.

FIG. 1 presents a composite map of a human LIF-R-encoding DNA sequence, including the cleavage sites for certain restriction endonucleases. The hLIF-R open reading frame is shown boxed. The signal sequence is shown as a hatched box and the transmembrane domain is shown as a solid box. Several hLIF-R clones were isolated from cDNA and genomic libraries as described in examples 1 and 4. The horizontal lines under the composite map indicate the portion of the hLIF-R sequence that is contained in the various clones.

FIGS. 2a through 2e present a human LIF-R DNA sequence and the amino acid sequence encoded thereby, derived by sequencing cDNA and genomic clones as described in example 4. Amino acids are numbered on the left and nucleotides on the right. The signal peptide includes amino acids -44 to -1. The transmembrane domain is heavily underlined, and potential N-linked glycosylation sites are marked with asterisks. Hallmark residues associated with the hematopoietin family of receptors are shown boxed. The horizontal arrow marks the point at which genomic sequence was used to derive the 3' coding region of the hLIF-R. All cDNA clones terminated with a stretch of A nucleotides at this point.

FIG. 3 is a schematic representation of a human LIF-R homodimer. The homodimeric receptor comprises two soluble human LIF-R/Fc fusion proteins joined by disulfide bonds between the Fc moieties.

FIG. 4 shows the positions of restriction endonuclease cleavage sites in the polylinker segment and 5' end of the Fc cDNA insert in plasmid hIgG1Fc, as described in example 5.

DETAILED DESCRIPTION OF THE INVENTION

Definitions

"Leukemia inhibitory factor receptor" and "LIF-R" refer to proteins which are present on the surface of various hematopoietic cells including monocyte-macrophages and megakaryocytes, and on non-hematopoietic cells, including osteoblasts, placental trophoblasts, and liver parenchymal cells, and which are capable of binding leukemia inhibitory factor (LIF) molecules and, in their native configuration as mammalian plasma membrane proteins, play a role in transducing the signal provided by LIF to a cell. The mature full-length human LIF-R has been previously described as a protein having a molecular weight of approximately 90 kDa; however, the molecular weight of the human LIF-R protein disclosed herein, and shown in SEQ ID NO:2, is about 190,000 kDa. As used herein, the above terms include analogs or fragments of native LIF-R proteins with LIF-binding or signal transducing activity. Specifically included are truncated, soluble or fusion forms of LIF-R protein as defined below. In the absence of any species designation, LIF-R refers generically to mammalian LIF-R, which includes, but is not limited to, human, murine, and bovine LIF-R. Similarly, in the absence of any specific designation for deletion mutants, the term LIF-R means all forms of LIF-R, including mutants and analogs which possess LIF-R biological activity.

"Soluble LIF-R" or "sLIF-R" as used in the context of the present invention refer to proteins, or substantially equivalent analogs, which are substantially similar to all or part of the extracellular region of a native LIF-R, and are secreted by the cell but retain the ability to bind LIF or inhibit LIF signal transduction activity via cell surface bound LIF-R proteins. Soluble LIF-R proteins may also include pan of the transmembrane region or part of the cytoplasmic domain or other sequences, provided that the soluble LIF-R protein is capable of being secreted from the cell. Inhibition of LIF signal transduction activity can be determined using primary cells or cells lines which express an endogenous LIF-R and which are biologically responsive to LIF or which, when transfected with recombinant LIF-R DNAs, are biologically responsive to LIF. The cells are then contacted with LIF and the resulting metabolic effects examined. If an effect results which is attributable to the action of the ligand, then the recombinant receptor has signal transduction activity. Exemplary procedures for determining whether a polypeptide has signal transduction activity are disclosed by Idzerda et al., J. Exp. Med. 171:861 (1990 ); Curtis et al., Proc. Natl. Acad. Sci. USA 86:3045 (1989); Prywes et al., EMBO J. 5:2179 (1986) and Chou et al., J. Biol. Chem. 262:1842 (1987).

The term "isolated" or "purified", as used in the context of this specification to define the purity of LIF-R protein or protein compositions, means that the protein or protein composition is substantially free of other proteins of natural or endogenous origin and contains less than about 1% by mass of protein contaminants residual of production processes. Such compositions, however, can contain other proteins added as stabilizers, carriers, excipients or co-therapeutics. LIF-R is purified to substantial homogeneity if it is detectable as a single protein band in a polyacrylamide gel by silver staining.

The term "substantially similar," when used to define either amino acid or nucleic acid sequences, means that a particular subject sequence, for example, a mutant sequence, varies from a reference sequence (e.g., a native sequence) by one or more substitutions, deletions, or additions, the net effect of which is to retain biological activity of the LIF-R protein as may be determined, for example, in LIF-R binding assays, such as is described in Example 1 below. In one embodiment of the invention, such a mutant amino acid sequence is at least 90% identical, preferably at least 95% identical, to the amino acid sequence of a native LIF-R protein (e.g., the native sequence presented in SEQ ID NOS: 2, 4 or 6). In other words, at least 90% of the amino acids of a native LIF-R sequence are present, and in the same order, in the mutant sequence. For fragments of LIF-R proteins (e.g., soluble LIF-R polypeptides), the term "at least 90% identical" refers to that portion of the reference native sequence that is found in the LIF-R fragment.

Computer programs are available for determining the percent identity between two DNA or amino acid sequences (e.g., between a mutant sequence and a native sequence). One example is the GAP computer program, version 6.0, described by Devereux et al. (Nucl. Acids Res. 12:387, 1984) and available from the University of Wisconsin Genetics Computer Group (UWGCG). The GAP program utilizes the alignment method of Needleman and Wunsch (J. Mol. Biol. 48:443, 1970), as revised by Smith and Waterman (Adv. Appl. Math 2:482, 1981).

Alternatively, nucleic acid subunits and analogs are "substantially similar" to the specific native DNA sequences disclosed herein (e.g., the sequences of SEQ ID NOS: 1, 3, or 5) if the DNA sequence is capable of hybridization to a native LIF-R DNA sequence under moderately stringent conditions (50° C., 2×SSC) and encodes biologically active LIF-R protein; or the DNA sequence is degenerate as a result of the genetic code to one of the foregoing native or hybridizing DNA sequences and encodes a biologically active LIF-R protein. DNA sequences that hybridize to a native LIF-R DNA sequence under conditions of severe stringency, and which encode biologically active LIF-R, are also encompassed by the present invention. Moderate and severe stringency hybridization conditions are terms understood by the skilled artisan and have been described in, for example, Sambrook et al. Molecular Cloning: A Laboratory Manual, 2 ed. Vol. 1, pp. 1.101-104, Cold Spring Harbor Laboratory Press, (1989). LIF-R proteins encoded by the foregoing DNA sequences are provided by the present invention.

"Recombinant," as used herein, means that a protein is derived from recombinant (e.g., microbial or mammalian) expression systems. "Microbial" refers to recombinant proteins made in bacterial or fungal (e.g., yeast) expression systems. As a product, "recombinant microbial" defines a protein essentially free of native endogenous substances and unaccompanied by associated native glycosylation. Protein expressed in most bacterial cultures, e.g., E. coli, will be free of glycan; protein expressed in yeast may have a glycosylation pattern different from that expressed in mammalian cells.

"Biologically active," as used throughout the specification as a characteristic of LIF-R, means either that a particular molecule shares sufficient amino acid sequence similarity with a native LIF-R protein to be capable of binding detectable quantities of LIF, preferably at least 0.01 nmoles LIF per nanomole LIF-R, or, in the alternative, shares sufficient amino acid sequence similarity to be capable of transmitting an LIF stimulus to a cell, for example, as a component of a hybrid receptor construct. More preferably, biologically active LIF-R within the scope of the present invention is capable of binding greater than 0.1 nanomoles LIF per nanomole receptor, and most preferably, greater than 0.5 nanomoles LIF per nanomole receptor.

"DNA sequence" refers to a DNA polymer, in the form of a separate fragment or as a component of a larger DNA construct, which has been derived from DNA isolated at least once in substantially pure form, i.e., free of contaminating endogenous materials and in a quantity or concentration enabling identification, manipulation, and recovery of the sequence and its component nucleotide sequences by standard biochemical methods, for example, using a cloning vector. Such sequences are preferably provided in the form of an open reading frame uninterrupted by internal nontranslated sequences, or introns, which are typically present in eukaryotic genes. However, it will be evident that genomic DNA containing the relevant sequences could also be used. Sequences of non-translated DNA may be present 5' or 3' from the open reading frame, where the same do not interfere with manipulation or expression of the coding regions.

"Nucleotide sequence" refers to a heteropolymer of deoxyribonucleotides. DNA sequences encoding the proteins provided by this invention may be assembled from cDNA fragments and short oligonucleotide linkers, or from a series of oligonucleotides, to provide a synthetic gene which is capable of being expressed in a recombinant transcriptional unit.

"Recombinant expression vector" refers to a plasmid comprising a transcriptional unit comprising an assembly of (1) a genetic element or elements having a regulatory role in gene expression, for example, promoters or enhancers, (2) a structural or coding sequence which is transcribed into mRNA and translated into protein, and (3) appropriate transcription and translation initiation and termination sequences. Structural elements intended for use in yeast expression systems preferably include a leader sequence enabling extracellular secretion of translated protein by a host cell. Alternatively, where recombinant protein is expressed without a leader or transport sequence, it may include an N-terminal methionine residue. This residue may optionally be subsequently cleaved from the expressed recombinant protein to provide a final product.

"Recombinant microbial expression system" means a substantially homogeneous monoculture of suitable host microorganisms, for example, bacteria such as E. coli or yeast such as S. cerevisiae, which have stably integrated a recombinant transcriptional unit into chromosomal DNA or carry the recombinant transcriptional unit as a component of a resident plasmid. Generally, cells constituting the system are the progeny of a single ancestral transformant. Recombinant expression systems as defined herein will express heterologous protein upon induction of the regulatory elements linked to the DNA sequence or synthetic gene to be expressed.

Isolation of DNA Encoding LIF-R

A human DNA sequence encoding human LIF-R was isolated from a cDNA library prepared using standard methods by reverse transcription of polyadenylated RNA isolated from human placental cells. Transfectants expressing biologically active LIF-R were initially identified using a modified slide autoradiographic technique, substantially as described by Gearing et al., EMBO J. 8:3667, 1989. Briefly, COS-7 cells were transfected with miniprep DNA in pDC303 from pools of cDNA clones directly on glass slides and cultured for 2-3 days to permit transient expression of LIF-R. The slides containing the transfected cells were then incubated with medium containing ¹²⁵ I-LIF, washed to remove unbound labeled LIF, fixed with glutaraldehyde, and dipped in liquid photographic emulsion and exposed in the dark. After developing the slides, they were individually examined with a microscope and positive cells expressing LIF-R were identified by the presence of autoradiographic silver grains against a light background.

Using this approach, approximately 240,000 cDNAs were screened in pools of approximately 2,400 cDNAs using the slide autoradiographic method until assay of one transfectant pool showed multiple cells clearly positive for LIF binding. This pool was then partitioned into pools of 600 and again screened by slide autoradiography and a positive pool was identified. This pool was further partitioned into pools of 60 and screened by plate binding assays analyzed by quantitation of bound ¹²⁵ I-LIF. The cells were scraped off and counted to determine which pool of 60 was positive. Individual colonies from this pool of 60 were screened until a single clone (clone 65) was identified which directed synthesis of a surface protein with detectable LIF binding activity. This clone was isolated, and its insert is sequenced to determine the sequence of the human LIF-R cDNA clone 65. The cloning vector pDC303 which contains the human LIF-R cDNA clone 65 was deposited with the American Type Culture Collection, Rockville, Md., USA (ATCC) on Dec. 11, 1990, under the name pHLIFR-65 (ATCC Accession No. 68491). The deposit was made under the conditions of the Budapest Treaty.

A probe may be constructed from the human sequence and used to screen various other mammalian cDNA libraries. cDNA clones which hybridize to the human probe are then isolated and sequenced.

A murine LIF-R cDNA clone was isolated by cross-species hybridization to a probe derived from human clone 65. The murine clone encoded a LIF-R protein that lacked a transmembrane region and thus was secreted rather than being retained on the cell membrane. Isolation of this murine clone is described in Example 2. Probes derived from this clone may be used in screening murine cDNA or genomic libraries to identify additional murine LIF-R clones.

A probe derived from the human clone 65 was also used in screening human cDNA and genomic libraries to identify additional human LIF-R clones. The DNA sequence presented in SEQ ID NO:5 was derived by sequencing and alignment of these human clones, as described in Example 4.

Like most mammalian genes, mammalian LIF-R is presumably encoded by multi-exon genes. Alternative mRNA constructs which can be attributed to different mRNA splicing events following transcription, and which share large regions of identity or similarity with the cDNAs claimed herein, are considered to be within the scope of the present invention.

Proteins and Analogs

The present invention provides purified mammalian LIF-R polypeptides, both recombinant and non-recombinant (the latter being purified from naturally-occurring cellular sources). Isolated LIF-R polypeptides of this invention are substantially free of other contaminating materials of natural or endogenous origin and contain less than about 1% by mass of protein contaminants residual of production processes. The LIF-R polypeptides of this invention are optionally without associated native-pattern glycosylation.

Mammalian LIF-R of the present invention includes, by way of example, primate, human, murine, canine, feline, bovine, ovine, equine, caprine and porcine LIF-R. Mammalian LIF-R can be obtained by cross species hybridization, for example using a single stranded cDNA derived from the human LIF-R DNA sequence, clone 65, as a hybridization probe to isolate LIF-R cDNAs from mammalian cDNA libraries.

Derivatives of LIF-R within the scope of the invention also include various structural forms of the primary protein which retain biological activity. Due to the presence of ionizable amino and carboxyl groups, for example, a LIF-R protein may be in the form of acidic or basic salts, or may be in neutral form. Individual amino acid residues may also be modified by oxidation or reduction.

The primary amino acid structure may be modified by forming covalent or aggregative conjugates with other chemical moieties, such as glycosyl groups, lipids, phosphate, acetyl groups and the like, or by creating amino acid sequence murals. Covalent derivatives are prepared by linking particular functional groups to LIF-R amino acid side chains or at the N- or C-termini. Other derivatives of LIF-R within the scope of this invention include covalent or aggregative conjugates of LIF-R or its fragments with other proteins or polypeptides, such as by synthesis in recombinant culture as N-terminal or C-terminal fusions. For example, the conjugated peptide may be a a signal (or leader) polypeptide sequence at the N-terminal region of the protein which co-translationally or post-translationally directs transfer of the protein from its site of synthesis to its site of function inside or outside of the cell membrane or wall (e.g., the yeast or-factor leader). LIF-R protein fusions can comprise peptides added to facilitate purification or identification of LIF-R (e.g., poly-His). The amino acid sequence of LIF-R can also be linked to the peptide Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys (DYKDDDDK) (Hopp et al., Bio/Technology 6:1204,1988 and U.S. Pat. No. 5,011,912.) The latter sequence is highly antigenic and provides an epitope reversibly bound by a specific monoclonal antibody, enabling rapid assay and facile purification of expressed recombinant protein. This sequence is also specifically cleaved by bovine mucosal enterokinase at the residue immediately following the Asp-Lys pairing. Fusion proteins capped with this peptide may also be resistant to intracellular degradation in E. coli.

LIF-R derivatives may also be used as immunogens, reagents in receptor-based immunoassays, or as binding agents for affinity purification procedures of LIF or other binding ligands. LIF-R derivatives may also be obtained by cross-linking agents, such as M-maleimidobenzoyl succinimide ester and N-hydroxysuccinimide, at cysteine and lysine residues. LIF-R proteins may also be covalently bound through reactive side groups to various insoluble substrates, such as cyanogen bromide-activated, bisoxirane-activated, carbonyldiimidazole-activated or tosyl-activated agarose structures, or by adsorbing to polyolefin surfaces (with or without glutaraldehyde cross-linking). Once bound to a substrate, LIF-R may be used to selectively bind (for purposes of assay or purification) anti-LIF-R antibodies or LIF.

The LIF-R proteins of the present invention encompass proteins having amino acid sequences that vary from those of native LIF-R proteins, but that retain the ability to bind LIF or transduce a LIF-induced signal. Such variant proteins comprise one or more additions, deletions, or substitutions of amino acids when compared to a native sequence, but exhibit biological activity that is essentially equivalent to that of a native LIF-R protein. Likewise, the LIF-R-encoding DNA sequences of the present invention encompass sequences that comprise one or more additions, deletions, or substitutions of nucleotides when compared to a native LIF-R DNA sequence, but that encode a LIF-R protein that is essentially bioequivalent to a native LIF-R protein. Examples of such variant amino acid and DNA sequences (the "substantially similar" sequences discussed above) include, but are not limited to, the following.

Bioequivalent analogs of LIF-R proteins may be constructed by, for example, making various substitutions of residues or sequences or deleting terminal or internal residues or sequences not needed for biological activity. Bioequivalent analogs may be identified using the assays for biological activity that are described herein (e.g., in example 1). For example, cysteine residues not essential for biological activity can be deleted or replaced with other amino acids to prevent formation of unnecessary or incorrect intramolecular disulfide bridges upon renaturation. One or more of the cysteines that are not conserved in the hematopoietin receptor family (as indicated in FIGS. 2a through 2e) may be deleted or replaced, for example. Alternative embodiments (when the LIF-binding property of the LIF-R is desired but signal transduction is not necessary) include LIF-Rs in which the cysteines of the extracellular domain remain but cysteines of the cytoplasmic domain are deleted or replaced.

Another embodiment of the present invention involves modification of adjacent dibasic amino acid residues to enhance expression of LIF-R in yeast systems in which KEX2 protease activity is present. Site-specific mutagenesis procedures can be employed to inactivate KEX2 protease processing sites by deleting, adding, or substituting residues to alter Arg-Arg, Arg-Lys, and Lys-Arg pairs to eliminate the occurrence of these adjacent basic residues. Lys-Lys pairings are considerably less susceptible to KEX2 cleavage, and conversion of Arg-Lys or Lys-Arg to Lys-Lys represents a conservative and preferred approach to inactivating KEX2 sites. The resulting muteins are less susceptible to cleavage by the KEX2 protease at locations other than the yeast α-factor leader sequence, where cleavage upon secretion is intended. EP 212,914, is among the references disclosing the use of site-specific mutagenesis to inactivate KEX2 protease processing sites in a protein.

Review of the human LIF-R sequence of FIGS. 2a through 2e reveals Arg-Arg, Arg-Lys, or Lys-Arg doublets at amino acids -36 and -35; -27 and -26; 134 and 135; 339 and 340; 631 and 632; 816 and 817; and 817 and 818. From one to all of these KEX2 protease processing sites may be inactivated.

The present invention includes LIF-R with or without associated native-pattern glycosylation. LIF-R expressed in yeast or mammalian expression systems, e.g., COS-7 cells, may be similar or slightly different in molecular weight and glycosylation pattern than the native molecules, depending upon the expression system. Expression of LIF-R DNAs in bacteria such as E. coli provides non-glycosylated molecules. Functional mutant analogs of mammalian LIF-R having inactivated N-glycosylation sites can be produced by oligonucleotide synthesis and ligation or by site-specific mutagenesis techniques. These analog proteins can be produced in a homogeneous, reduced-carbohydrate form in good yield using yeast expression systems. N-glycosylation sites in eukaryotic proteins are characterized by the amino acid triplet Asn-A₁ -Z, where A₁ is any amino acid except Pro, and Z is Ser or Thr. In this sequence, asparagine provides a side chain amino group for covalent attachment of carbohydrate. Such sites can be eliminated by substituting another amino acid for Asn or for residue Z, deleting Asn or Z, or inserting a non-Z amino acid between A₁ and Z, or an amino acid other than Asn between Asn and A₁.

Known procedures for inactivating N-glycosylation sites in proteins include those described in U.S. Pat. No. 5,071,972 and EP 276,846. N-glycosylation sites in human LIF-R are indicated by asterisks in FIGS. 2a through 2e. From one to all of these sites may be inactivated. In one embodiment of the invention, if reduction but not elimination of glycosylation is desired, the first (i.e., N-terminal) five N-glycosylation sites of human LIF-R are inactivated. Review of the murine LIF-R amino acid sequence of SEQ ID NO:4 reveals that the murine protein lacks these five N-glycosylation sites (located in the fast hematopoietin domain), indicating that these sites may be deleted from the human protein as well without eliminating the protein's LIF-binding property.

Generally, substitutions should be made conservatively; i.e., the most preferred substitute amino acids are those having physiochemical characteristics resembling those of the residue to be replaced. Similarly, when a deletion or insertion strategy is adopted, the potential effect of the deletion or insertion on biological activity should be considered. Examples of conservative substitutions include substitution of one aliphatic residue for another, such as Ile, Val, Leu, or Ala for one another, or substitutions of one polar residue for another, such as between Lys and Arg; Glu and Asp; or Gln and Asn. Other such conservative substitutions, for example, substitutions of entire regions having similar hydrophobicity characteristics, are well known. Moreover, particular amino acid differences between human, murine and other mammalian LIF-Rs is suggestive of additional conservative substitutions that may be made without altering the essential biological characteristics of LIF-R.

Subunits (fragments) of LIF-R may be constructed by deleting terminal or internal residues or sequences. LIF-R fragments encompassed by the present invention include, but are not limited to, the following. Additional biologically active LIF-R fragments may be identified using assays such as those described in example 1. One example of an LIF-R fragment comprises amino acids 1 to 945 of SEQ ID NO: 1. As described in example 4, amino acid 945 is the last amino acid of the polypeptide encoded by clone pHLIF-R-65, before the poly-A nucleotide segment believed to result from oligo(dT) priming at an internal site in the mRNA during preparation of the hLIF-R cDNA.

LIF-binding activity resides in the extracellular domain. Thus, for applications requiring LIF-binding activity (but not the signal transducing activity conferred by the cytoplasmic domain), useful LIF-R proteins include those lacking all or part of the transmembrane region or the cytoplasmic domain of the protein. Human LIF-R fragments thus include those containing amino acids -44-x or, when the signal sequence is not desired, amino acids 1-x of the full length LIF-R sequence depicted in SEQ ID NO:5, wherein x represents an integer from 789 to 1052. Amino acid number 789 is the last amino acid of the extracellular domain (i.e., before the start of the transmembrane region). Polypeptides terminating in amino acid number 1052 lack the last C-terminal amino acid of the full length protein. The desirability of including the signal sequence depends on such factors as the position of LIF-R when it is a component of a fusion protein, and the intended host cells when the receptor is to be produced via recombinant DNA technology. Other LIF-R polypeptides may be chosen with regard to sequences that are conserved in the hematopoietin receptor family, (i.e., chosen to include the boxed sequence(s) shown in FIGS. 2a thorough 2e).

In one embodiment of the present invention, the LIF-R fragment is a soluble LIF-R polypeptide in which the transmembrane region and intracellular (cytoplasmic) domain of LIF-R are deleted or substituted with hydrophilic residues to facilitate secretion of the receptor into the cell culture medium. Soluble LIF-R proteins may also include part of the transmembrane region, provided that the soluble LIF-R protein is capable of being secreted from the cell. The resulting protein is referred to as a soluble LIF-R molecule which retains its ability to bind LIF. The present invention contemplates such soluble LIF-R constructs corresponding to all or part of the extracellular region of LIF-R. The resulting soluble LIF-R constructs are then inserted and expressed in appropriate expression vectors and assayed for the ability to bind LIF, as described in Example 1. Biologically active soluble LIF-Rs resulting from such constructions are also contemplated to be within the scope of the present invention.

Examples of soluble LIF-R proteins include, but are not limited to, the following. One soluble human LIF-R polypeptide comprises the entire extracellular domain, i.e. amino acids 1-789 of SEQ ID NO:2. Other soluble LIF-Rs are truncated upstream of the transmembrane region, but preferably include that portion of the protein that contains the residues conserved among the members of the hematopoietin receptor family (shown boxed in FIGS. 2a through 2e), i.e., amino acids 11-479 of SEQ ID NO:2. The N-terminus of such soluble LIF-Rs is any of amino acids 1-11, and the protein extends to a C-terminus selected from any of amino acids 479 through 789. Two such soluble proteins comprise amino acids 1-702 or 1-775 of SEQ ID NO:1. Constructs encoding these proteins may be prepared by techniques that involve cleaving the human LIF-R cDNA of clone 65 (example 1) with the restriction endonucleases Asp718 and Xmn1 or with Asp718 and Bsp1286I. Asp718 cleaves the vector upstream of the inserted LIF-R-encoding cDNA. Xmn1 cleaves within the codon for Asp at position 702 and Bsp1286I cleaves just 3' of the codon for Val at position 775. If desired, an oligonucleotide may be ligated to the 3' end of the Asp718/Bsp1286I fragment to extend the LIF-R sequence, e.g., through amino acid number 789.

Other soluble human LIF-Rs comprise amino acids 1-678 or 1-680. When the human and murine LIF-R amino acid sequences disclosed herein are aligned (with gaps introduced to maximize identity between the two sequences), amino acid 680 of the human sequence is aligned with the last amino acid of the murine protein, and amino acid 678 is the last amino acid of the human sequence that is identical to a corresponding amino acid in the murine sequence. Since the murine protein binds LIF, the murine LIF-R contains that portion of the protein required for LIF binding.

The murine cDNA isolated in example 2 encodes a naturally occurring soluble LIF-R protein. DNA sequences encoding soluble human LIF-R proteins may be derived from the isolated cDNA encoding membrane-bound human LIF-R (described in example 1) by conventional procedures, in view of the sequence information presented in SEQ ID NO:1. Among the procedures that may be employed to isolate and amplify a DNA fragment encoding truncated LIF-R is the well known polymerase chain reaction (PCR) procedure. See Recombinant DNA Methodology, Wu et al. eds., Academic Press Inc., San Diego (1989), pp 189-196. Alternative procedures include restriction endonuclease digestion of cloned LIF-R DNA, isolation of the desired fragment by gel electrophoresis, and subcloning of the fragment into an expression vector using conventional procedures. Oligonucleotides may be ligated to an isolated DNA fragment to regenerate the 5' or 3' terminus to a desired point in the sequence. The sequence of such oligonucleotides, as well as the primers employed in PCR, may be based upon the DNA sequence presented in SEQ ID NO: 1.

The N- or C-terminus of the LIF-R proteins of the present invention may vary according to such factors as the type of host cells employed when producing the protein via recombinant DNA technology and the particular cells from which the protein is purified when non-recombinant LIF-R is employed. Such variations may be attributable to differential post-translational processing of the protein in various types of cells, for example. Variations in the N- or C-terminal sequence also may result from the oligonucleotides chosen to reconstruct either terminus of the LIF-R encoding DNA sequence when constructing expression vectors.

Differential processing may result in mature LIF-R proteins having an N-terminal amino acid other than those shown at position 1 of SEQ ID NOS:2, 4, and 6. For example, in certain host cells, post-translational processing will remove the methionine residue encoded by an initiation codon, whereas the methionine residue will remain at the N-terminus of proteins produced in other types of host cells. Further, the N- and C-termini have been known to vary for the same protein, depending on the source of the protein. In some cases, the deletion of amino acids at either terminus of the protein may be due to proteolysis, occurring either intracellularly or during purification. Varying N-termini may also result from cleavage of the signal peptide in certain host cells at a point other than between amino acids -1 and 1 of the disclosed sequences.

The LIF-R proteins of the present invention thus include proteins having termini that vary from those shown in SEQ ID NOS:2 and 6 (human) or 4 (murine). The N-terminal amino acid of the mature protein may, for example, be any of the amino acids at positions 1 to 5 of SEQ ID NOS:2, 4, or 6. The C-terminus may be truncated deliberately during expression vector construction (e.g., in constructing vectors encoding soluble proteins as described above) or as a result of differential processing which may remove up to about five C-terminal amino acids, for example.

Mutations in nucleotide sequences constructed for expression of the above-described variant or analog LIF-R proteins should, of course, preserve the reading frame phase of the coding sequences and preferably will not create complementary regions that could hybridize to produce secondary mRNA structures such as loops or hairpins which would adversely affect translation of the receptor mRNA. Although a mutation site may be predetermined, it is not necessary that the nature of the mutation per se be predetermined. For example, in order to select for optimum characteristics of mutants at a given site, random mutagenesis may be conducted at the target codon and the expressed LIF-R mutants screened for the desired activity.

Not all mutations in the nucleotide sequence which encodes LIF-R will be expressed in the final product. For example, nucleotide substitutions may be made to enhance expression, primarily to avoid secondary structure loops in the transcribed mRNA (see EPA 75,444A, incorporated herein by reference), or to provide codons that are more readily translated by the selected host, e.g., the well-known E. coli preference codons for E. coli expression (see U.S. Pat. No. 4,425,437, column 6). The known degeneracy of the genetic code permits variation of a DNA sequence without altering the amino acid sequence, since a given amino acid may be encoded by more than one codon.

Mutations can be introduced at particular loci by synthesizing oligonucleotides containing a mutant sequence, flanked by restriction sites enabling ligation to fragments of the native sequence. Following ligation, the resulting reconstructed sequence encodes an analog having the desired amino acid insertion, substitution, or deletion.

Alternatively, oligonucleotide-directed site-specific mutagenesis procedures can be employed to provide an altered gene having particular codons altered according to the substitution, deletion, or insertion required. Exemplary methods of making the alterations set forth above are disclosed by Walder et al. (Gene 42:133, 1986); Bauer et al. (Gene 37:73, 1985); Craik (BioTechniques, January 1985, 12-19); Smith et al. (Genetic Engineering: Principles and Methods, Plenum Press, 1981); and U.S. Pat. Nos. 4,518,584 and 4,737,462 disclose suitable techniques, and are incorporated by reference herein.

The LIF-R proteins of the present invention encompass proteins encoded by (a) a DNA sequence derived from the coding region of a native LIF-R gene or (b) a DNA sequence capable of hybridization to a native LIF-R DNA of (a) under moderately stringent conditions and which encodes biologically active LIF-R. LIF-R proteins encoded by a DNA molecule that varies from the DNA sequences of SEQ ID NOS: 1, 3, and 5, wherein one strand of the DNA molecule will hybridize to the DNA sequence presented in SEQ ID NOS: 1, 3, or 5, include, but are not limited to, LIF-R fragments (soluble or membrane-bound) and LIF-R proteins comprising inactivated N-glycosylation site(s), inactivated KEX2 protease processing site(s), and/or conservative amino acid substitution(s), as described above. LIF-R proteins encoded by DNA derived from other mammalian species, wherein the DNA will hybridize to the human or murine DNA of SEQ ID NOS: 1, 3, or 5, are also encompassed.

Both monovalent forms and polyvalent forms of LIF-R are useful in the compositions and methods of this invention. Polyvalent forms possess multiple LIF-R binding sites for LIF ligand. For example, a bivalent soluble LIF-R may consist of two tandem repeats of the extracellular region of LIF-R, separated by a linker region. Two LIF-R polypeptides (each capable of binding LIF) may be joined via any suitable means, e.g., using one of the commercially available cross-linking reagents used to attach one polypeptide to another (Pierce Chemical Co., Rockford, Ill.). Alternatively, a fusion protein comprising multiple LIF-R polypeptides joined via peptide linkers may be produced using recombinant DNA technology. Suitable peptide linkers comprise a chain of amino acids, preferably from 20 to 100 amino acids in length. The linker advantageously comprises amino acids selected from the group consisting of glycine, asparagine, serine, threonine, and alanine. Examples of suitable peptide linkers include, but are not limited to, (Gly₄ Ser)_(n), wherein n is 4-12, and (Gly₄ SerGly₅ Ser)₂. The use of such peptide linkers is illustrated in U.S. Pat. No. 5,073,627, for example.

A DNA sequence encoding a desired peptide linker may be inserted between, and in the same reading frame as, two DNA sequences encoding LIF-R using any suitable conventional technique. For example, a chemically synthesized oligonucleotide encoding the linker and containing appropriate restriction endonuclease cleavage sites may be ligated between two LIF-R encoding sequences. The resulting gene fusion is inserted into an expression vector for production of the fusion protein in the desired host cells.

Alternate polyvalent forms may also be constructed, for example, by chemically coupling LIF-R to any clinically acceptable carrier molecule, a polymer selected from the group consisting of Ficoll, polyethylene glycol or dextran using conventional coupling techniques. Alternatively, LIF-R may be chemically coupled to biotin, and the biotin-LIF-R conjugate then allowed to bind to avidin, resulting in tetravalent avidin/biotin/LIF-R molecules. LIF-R may also be covalently coupled to dinitrophenol (DNP) or trinitrophenol (TNP) and the resulting conjugate precipitated with anti-DNP or anti-TNP-IgM, to form decameric conjugates with a valency of 10 for LIF-R binding sites.

A recombinant chimeric antibody molecule may also be produced having LIF-R sequences substituted for the variable domains of either or both of the immunoglobulin molecule heavy and light chains and having unmodified constant region domains. For example, chimeric LIF-R/IgG₁ may be produced from two chimeric genes--a LIF-R/human k light chain chimera (LIF-R/C_(k)) and a LIF-R/human g1 heavy chain chimera (LIF-R/C_(g-1)). Following transcription and translation of the two chimeric genes, the gene products assemble into a single chimeric antibody molecule having LIF-R displayed bivalently. Assembly of two sets of the two chimeric proteins results in a molecule comprising two LIF-R/light chain fusions and two LIF-R/heavy chain fusions. LIF-R is displayed tetravalently. Assembly occurs when disulfide bonds form between the polypeptide chains, as occurs in native antibodies. Such polyvalent forms of LIF-R may have enhanced binding affinity for LIF ligand. Additional details relating to the construction of such chimeric antibody molecules are disclosed in WO 89/09622 and EP 315062.

Alternatively, a LIF-R DNA sequence may be fused to a DNA sequence encoding an antibody Fc region polypeptide. Dimeric forms of LIF-R include homodimers comprising two LIF-R/Fc fusion proteins joined by disulfide bonds between the Fc moieties. Such homodimers preferably comprise one of the soluble LIF-R polypeptides described above, with an antibody Fc region polypeptide attached to the C-terminus of the LIF-R polypeptide. The LIF-R/Fc fusion proteins optionally comprise a peptide linker (described above) positioned between the LIF-R polypeptide and the antibody Fc polypeptide. One peptide linker is described in example 5.

By "antibody Fc region polypeptides" is meant polypeptides corresponding to the Fc region of an antibody, or fragments thereof comprising sufficient cysteine residues so that disulfide bonds will form between two Fc polypeptides. N-terminal fragments of an antibody Fc region that contain the cysteine residues involved in disulfide bond formation at the hinge region may be employed. Examples include the Fc polypeptide described in example 5 and fragments thereof comprising at least the first three cysteine residues (hinge region). Procedures for isolating the Fc region of an antibody are well-known and include proteolytic digestion with papain. Alternatively, an Fc polypeptide may be produced by recombinant cells or chemically synthesized.

In one embodiment, the present invention provides an isolated DNA sequence encoding a soluble fusion protein comprising an N-terminal signal peptide followed by a human LIF-R polypeptide (derived from the extracellular domain), which is followed by an antibody Fc polypeptide. The signal sequence may be the native human LIF-R signal peptide (amino acids -44 to -1 of SEQ ID NO: 1) or a heterologous signal peptide, chosen according to the host cells to be employed. Heterologous signal peptides include the yeast α factor leader peptide described below and the IL-7 leader peptide described in U.S. Pat. No. 4,965,195, for example.

One example of a dimeric receptor comprising two LIF-R/Fc polypeptides is illustrated in example 5 below. The receptor is depicted in FIG. 3. The number and position of disulfide bonds may vary from those shown in FIG. 3.

Expression Of Recombinant LIF-R

The present invention provides recombinant expression vectors to amplify or express DNA encoding LIF-R. Recombinant expression vectors are replicable DNA constructs which have synthetic or cDNA-derived DNA fragments encoding mammalian LIF-R or bioequivalent analogs operably linked to suitable transcriptional or translational regulatory elements derived from mammalian, microbial, vital or insect genes. A transcriptional unit generally comprises an assembly of (1) a genetic element or elements having a regulatory role in gene expression, for example, transcriptional promoters or enhancers, (2) a structural or coding sequence which is transcribed into mRNA and translated into protein, and (3) appropriate transcription and translation initiation and termination sequences, as described in detail below. Such regulatory elements may include an operator sequence to control transcription, a sequence encoding suitable mRNA ribosomal binding sites. The ability to replicate in a host, usually conferred by an origin of replication, and a selection gene to facilitate recognition of transformants may additionally be incorporated. DNA regions are operably linked when they are functionally related to each other. For example, DNA for a signal peptide (secretory leader) is operably linked to DNA for a polypeptide if it is expressed as a precursor which participates in the secretion of the polypeptide; a promoter is operably linked to a coding sequence if it controls the transcription of the sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to permit translation. Generally, operably linked means contiguous and, in the case of secretory leaders, contiguous and in reading frame. Structural elements intended for use in yeast expression systems preferably include a leader sequence enabling extracellular secretion of translated protein by a host cell. Alternatively, where recombinant protein is expressed without a leader or transport sequence, it may include an N-terminal methionine residue. This residue may optionally be subsequently cleaved from the expressed recombinant protein to provide a final product.

DNA sequences encoding mammalian LIF-Rs which are to be expressed in a microorganism will preferably contain no introns that could prematurely terminate transcription of DNA into mRNA; however, premature termination of transcription may be desirable, for example, where it would result in mutants having advantageous C-terminal truncations, for example, deletion of a transmembrane region to yield a soluble receptor not bound to the cell membrane. Due to code degeneracy, there can be considerable variation in nucleotide sequences encoding the same amino acid sequence. Other embodiments include sequences capable of hybridizing to clone 65 under moderately stringent conditions (50° C., 2×SSC) and other sequences hybridizing or degenerate to those which encode biologically active LIF-R polypeptides.

Recombinant LIF-R DNA is expressed or amplified in a recombinant expression system comprising a substantially homogeneous monoculture of suitable host microorganisms, for example, bacteria such as E. coli or yeast such as S. cerevisiae, which have stably integrated (by transformation or transfection) a recombinant transcriptional unit into chromosomal DNA or carry the recombinant transcriptional unit as a component of a resident plasmid. Mammalian host cells are preferred for expressing recombinant LIF-R. Generally, cells constituting the system are the progeny of a single ancestral transformant. Recombinant expression systems as defined herein will express heterologous protein upon induction of the regulatory elements linked to the DNA sequence or synthetic gene to be expressed.

Transformed host cells are cells which have been transformed or transfected with LIF-R vectors constructed using recombinant DNA techniques. Transformed host cells ordinarily express LIF-R, but host cells transformed for purposes of cloning or amplifying LIF-R DNA do not need to express LIF-R. Expressed LIF-R will be deposited in the cell membrane or secreted into the culture supernatant, depending on the LIF-R DNA selected. Suitable host cells for expression of mammalian LIF-R include prokaryotes, yeast or higher eukaryotic cells under the control of appropriate promoters. Prokaryotes include gram negative or gram positive organisms, for example E. coli or bacilli. Higher eukaryotic cells include established cell lines of mammalian origin as described below. Cell-free translation systems could also be employed to produce mammalian LIF-R using RNAs derived from the DNA constructs of the present invention. Appropriate cloning and expression vectors for use with bacterial, fungal, yeast, and mammalian cellular hosts are described by Pouwels et al. (Cloning Vectors: A Laboratory Manual, Elsevier, New York, 1985), the relevant disclosure of which is hereby incorporated by reference.

Prokaryotic expression hosts may be used for expression of LIF-R that do not require extensive proteolytic and disulfide processing. Prokaryotic expression vectors generally comprise one or more phenotypic selectable markers, for example a gene encoding proteins conferring antibiotic resistance or supplying an autotrophic requirement, and an origin of replication recognized by the host to ensure amplification within the host. Suitable prokaryotic hosts for transformation include E. coli, Bacillus subtilis, Salmonella typhimurium, and various species within the genera Pseudomonas, Streptomyces, and Staphyolococcus, although others may also be employed as a matter of choice.

Useful expression vectors for bacterial use can comprise a selectable marker and bacterial origin of replication derived from commercially available plasmids comprising genetic elements of the well known cloning vector pBR322 (ATCC 37017). Such commercial vectors include, for example, pKK223-3 and pGEX (Pharmacia Fine Chemicals, Uppsala, Sweden) and pGEM1 (Promega Biotec, Madison, Wis., USA). These pBR322 "backbone" sections are combined with an appropriate promoter and the structural sequence to be expressed. E. coli is typically transformed using derivatives of pBR322, a plasmid derived from an E. coli species (Bolivar et al., Gene 2:95, 1977). pBR322 contains genes for ampicillin and tetracycline resistance and thus provides simple means for identifying transformed cells.

Promoters commonly used in recombinant microbial expression vectors include the b-lactamase (penicillinase) and lactose promoter system (Chang et al., Nature 275:615, 1978; and Goeddel et al., Nature 281:544, 1979), the tryptophan (trp) promoter system (Goeddel et al., Nucl. Acids Res. 8:4057, 1980; and EPA 36,776) and tac promoter (Maniatis, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, p. 412, 1982). A particularly useful bacterial expression system employs the phage 1 PL promoter and cI8571ts thermolabile repressor. Plasmid vectors available from the American Type Culture Collection which incorporate derivatives of the 1P_(L) promoter include plasmid pHUB2, resident in E. coli strain JMB9 (ATCC 37092) and pPLc28, resident in E. coli RR1 (ATCC 53082).

Recombinant LIF-R proteins may also be expressed in yeast hosts, preferably from the Saccharomyces species, such as S. cerevisiae. Yeast of other genera, such as Pichia or Kluyveromyces may also be employed. Yeast vectors will generally contain an origin of replication from the 2m yeast plasmid or an autonomously replicating sequence (ARS), promoter, DNA encoding LIF-R, sequences for polyadenylation and transcription termination and a selection gene. Preferably, yeast vectors will include an origin of replication and selectable marker permitting transformation of both yeast and E. coli, e.g., the ampicillin resistance gene of E. coli and S. cerevisiae TRP1 or URA3 gene, which provides a selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan, and a promoter derived from a highly expressed yeast gene to induce transcription of a structural sequence downstream. The presence of the TRP1 or URA3 lesion in the yeast host cell genome then provides an effective environment for detecting transformation by growth in the absence of tryptophan or uracil.

Suitable promoter sequences in yeast vectors include the promoters for metallothionein, 3-phosphoglycerate kinase (Hitzeman et al., J. Biol. Chem. 255:2073, 1980) or other glycolytic enzymes (Hess et al., J. Adv. Enzyme Reg. 7:149, 1968; and Holland et al., Biochem. 17:4900, 1978), such as enolase, glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and glucokinase. Suitable vectors and promoters for use in yeast expression are further described in R. Hitzeman et al., EPA 73,657.

Preferred yeast vectors can be assembled using DNA sequences from pUC18 for selection and replication in E. coli (Amp^(r) gene and origin of replication) and yeast DNA sequences including a glucose-repressible ADH2 promoter and α-factor secretion leader. The ADH2 promoter has been described by Russell et al. (J. Biol. Chem. 258:2674, 1982) and Beier et al. (Nature 300:724, 1982). The yeast α-factor leader, which directs secretion of heterologous proteins, can be inserted between the promoter and the structural gene to be expressed. See, e.g., Kurjan et al., Cell 30:933, 1982; and Bitter et al., Proc. Natl. Acad. Sci. USA 8:5330, 1984. The leader sequence may be modified to contain, near its 3' end, one or more useful restriction sites to facilitate fusion of the leader sequence to foreign genes.

Suitable yeast transformation protocols are known to those of skill in the art; an exemplary technique is described by Hinnen et al., Proc. Natl. Acad. Sci. USA 75:1929, 1978, selecting for Trp⁺ transformants in a selective medium consisting of 0.67% yeast nitrogen base, 0.5% casamino acids, 2% glucose, 10 μg/ml adenine and 20 μg/ml uracil or URA+ transformants in medium consisting of 0.67% YNB, with amino acids and bases as described by Sherman et al., Laboratory Course Manual for Methods in Yeast Genetics, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 1986.

Host strains transformed by vectors comprising the ADH2 promoter may be grown for expression in a rich medium consisting of 1% yeast extract, 2% peptone, and 1% or 4% glucose supplemented with 80 μg/ml adenine and 80 μg/ml uracil. Derepression of the ADH2 promoter occurs upon exhaustion of medium glucose. Crude yeast supernatants are harvested by filtration and held at 4° C. prior to further purification.

Various mammalian or insect cell culture systems are also advantageously employed to express recombinant protein. Expression of recombinant proteins in mammalian cells is particularly preferred because such proteins are generally correctly folded, appropriately modified and completely functional. Examples of suitable mammalian host cell fines include the COS-7 lines of monkey kidney cells, described by Gluzman (Cell 23:175, 1981), and other cell lines capable of expressing a heterologous gene in an appropriate vector including, for example, L cells, C127, 3T3, Chinese hamster ovary (CHO), HeLa and BHK cell lines. Mammalian expression vectors may comprise nontranscribed elements such as an origin of replication, a suitable promoter and enhancer linked to the gene to be expressed, and other 5' or 3' flanking nontranscribed sequences, and 5' or 3' nontranslated sequences, such as necessary ribosome binding sites, a polyadenylation site, splice donor and acceptor sites, and transcriptional termination sequences. Baculovirus systems for production of heterologous proteins in insect cells are reviewed by Luckow and Summers, Bio/Technology 6:47 (1988).

The transcriptional and translational control sequences in expression vectors to be used in transforming vertebrate cells may be provided by vital sources. For example, commonly used promoters and enhancers are derived from Polyoma, Adenovirus 2, Simian Virus 40 (SV40), and human cytomegalovirus. DNA sequences derived from the SV40 viral genome, for example, SV40 origin, early and late promoter, enhancer, splice, and polyadenylation sites may be used to provide the other genetic elements required for expression of a heterologous DNA sequence. The early and late promoters are particularly useful because both are obtained easily from the virus as a fragment which also contains the SV40 viral origin of replication (Fiers et al., Nature 273:113, 1978). Smaller or larger SV40 fragments may also be used, provided the approximately 250 bp sequence extending from the Hind III site toward the Bgl1 site located in the viral origin of replication is included. Further, mammalian genomic LIF-R promoter, control and/or signal sequences may be utilized, provided such control sequences are compatible with the host cell chosen. Additional details regarding the use of a mammalian high expression vector to produce a recombinant mammalian LIF-R are provided in Example 1 below. Exemplary vectors can be constructed as disclosed by Okayama and Berg (Mol. Cell. Biol. 3:280, 1983).

A useful system for stable high level expression of mammalian receptor cDNAs in C127 murine mammary epithelial cells can be constructed substantially as described by Cosman et al. (Mol. Immunol. 23:935, 1986).

In preferred aspects of the present invention, recombinant expression vectors comprising LIF-R cDNAs are stably integrated into a host cell's DNA. Elevated levels of expression product is achieved by selecting for cell lines having amplified numbers of vector DNA. Cell lines having amplified numbers of vector DNA are selected, for example, by transforming a host cell with a vector comprising a DNA sequence which encodes an enzyme which is inhibited by a known drug. The vector may also comprise a DNA sequence which encodes a desired protein. Alternatively, the host cell may be co-transformed with a second vector which comprises the DNA sequence which encodes the desired protein. The transformed or co-transformed host cells are then cultured in increasing concentrations of the known drug, thereby selecting for drug-resistant cells. Such drug-resistant cells survive in increased concentrations of the toxic drug by over-production of the enzyme which is inhibited by the drug, frequently as a result of amplification of the gene encoding the enzyme. Where drug resistance is caused by an increase in the copy number of the vector DNA encoding the inhibitable enzyme, there is a concomitant co-amplification of the vector DNA encoding the desired protein (e.g., LIF-R) in the host cell's DNA.

A preferred system for such co-amplification uses the gene for dihydrofolate reductase (DHFR), which can be inhibited by the drug methotrexate (MTX). To achieve co-amplification, a host cell which lacks an active gene encoding DHFR is either transformed with a vector which comprises DNA sequence encoding DHFR and a desired protein, or is co-transformed with a vector comprising a DNA sequence encoding DHFR and a vector comprising a DNA sequence encoding the desired protein. The transformed or co-transformed host cells are cultured in media containing increasing levels of MTX, and those cells lines which survive are selected.

A particularly preferred co-amplification system uses the gene for glutamine synthetase (GS), which is responsible for the synthesis of glutamine from glutamate and ammonia using the hydrolysis of ATP to ADP and phosphate to drive the reaction. GS is subject to inhibition by a variety of inhibitors, for example methionine sulphoximine (MSX). Thus, LIF-R can be expressed in high concentrations by co-amplifying cells transformed with a vector comprising the DNA sequence for GS and a desired protein, or co-transformed with a vector comprising a DNA sequence encoding GS and a vector comprising a DNA sequence encoding the desired protein, culturing the host cells in media containing increasing levels of MSX and selecting for surviving cells. The GS co-amplification system, appropriate recombinant expression vectors and cells lines, are described in the following PCT applications: WO 87/04462, WO 89/01036, WO 89/10404 and WO 86/05807.

Recombinant proteins are preferably expressed by co-amplification of DHFR or GS in a mammalian host cell, such as Chinese Hamster Ovary (CHO) cells, or alternatively in a murine myeloma cell line, such as SP2/0-Ag14 or NS0 or a rat myeloma cell line, such as YB2/3.0-Ag20, disclosed in PCT applications WO/89/10404 and WO 86/05807.

Vectors derived from retroviruses may be employed in mammalian host cells. A preferred retroviral expression vector is tgLS(+) HyTK, described in PCT application WO 92/08796.

A preferred eukaryotic vector for expression of LIF-R DNA is disclosed below in Example 1. This vector, referred to as pDC303, was derived from the mammalian high expression vector pDC201 and contains regulatory sequences from SV40, CMV and adenovirus.

In an especially preferred expression system, a LIF-R-encoding DNA sequence is inserted into the mammalian expression vector pCAV-DHFR. The resulting recombinant expression vector is transfected into a DHFR⁻ Chinese hamster ovary cell line, e.g., CHO-DXB11. pCAV-DHFR is an expression vector containing SV40 promoter sequences upstream of a multiple cloning site and a dihydrofolate reductase (DHFR) gene as a selectable marker. The DHFR gene confers a selective advantage on otherwise DHFR⁻ mammalian cells that have taken up the vector, when grown in the presence of methotrexate (MTX).

pCAV/DHFR was prepared by inserting a DHFR gene into the plasmid vector known as pCAV/NOT (described in PCT application WO 90/05183). pCAV/NOT was assembled from pDC201 (a derivative of pMLSV, previously described by Cosman et al., Nature 312:768, 1984), SV40 and cytomegalovirus DNA and comprises, in sequence with the direction of transcription from the origin of replication: (1) SV40 sequences from coordinates 5171-270 including the origin of replication, enhancer sequences and early and late promoters; (2) cytomegalovirus sequences including the promoter and enhancer regions (nucleotides 67 1 to +63 from the sequence published by Boechart et at. (Cell 41:521, 1985); (3) adenovirus-2 sequences containing the first exon and part of the intron between the first and second exons of the tripartite leader, the second exon and part of the third exon of the tripartite leader and a multiple cloning site (MCS) containing sites for Xho1, Kpn1, Sma1, Not1 and Bgl1; (4) SV40 sequences from coordinates 4127-4100 and 2770-2533 that include the polyadenylation and termination signals for early transcription; (5) sequences derived from pBR322 and virus-associated sequences VAI and VAII of pDC201, with adenovirus sequences 10532-11156 containing the VAI and VAII genes, followed by pBR322 sequences from 4363-2486 and 1094-375 containing the ampicillin resistance gene and origin of replication.

Purification of Recombinant LIF-R

Purified recombinant mammalian LIF-Rs or analogs are prepared by culturing suitable host/vector systems to express the recombinant translation products of the DNAs of the present invention, which are then purified from culture media or cell extracts.

For example, supernatants from systems which secrete recombinant soluble LIF-R protein into culture media can be first concentrated using a commercially available protein concentration filter, for example, an Amicon or Millipore Pellicon ultrafiltration unit. Following the concentration step, the concentrate can be applied to a suitable purification matrix. For example, a suitable affinity matrix can comprise an LIF or lectin or antibody molecule bound to a suitable support. Alternatively, an anion exchange resin can be employed, for example, a matrix or substrate having pendant diethylaminoethyl (DEAE) groups. The matrices can be acrylamide, agarose, dextran, cellulose or other types commonly employed in protein purification. Alternatively, a cation exchange step can be employed. Suitable cation exchangers include various insoluble matrices comprising sulfopropyl or carboxymethyl groups. Sulfopropyl groups are preferred.

Finally, one or more reversed-phase high performance liquid chromatography (RP-HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant methyl or other aliphatic groups, can be employed to further purify a LIF-R composition. Some or all of the foregoing purification steps, in various combinations, can also be employed to provide a homogeneous recombinant protein.

Recombinant protein produced in bacterial culture is usually isolated by initial extraction from cell pellets, followed by one or more concentration, salting-out, aqueous ion exchange or size exclusion chromatography steps. Finally, high performance liquid chromatography (HPLC) can be employed for final purification steps. Microbial cells employed in expression of recombinant mammalian LIF-R can be disrupted by any convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents.

Fermentation of yeast which express soluble mammalian LIF-R as a secreted protein greatly simplifies purification. Secreted recombinant protein resulting from a large-scale fermentation can be purified by methods analogous to those disclosed by Urdal et al. (J. Chromatog. 296:171, 1984). This reference describes two sequential, reversed-phase HPLC steps for purification of recombinant human GM-CSF on a preparative HPLC column.

Human LIF-R synthesized in recombinant culture is characterized by the presence of non-human cell components, including proteins, in amounts and of a character which depend upon the purification steps taken to recover human LIF-R from the culture. These components ordinarily will be of yeast, prokaryotic or non-human higher eukaryotic origin and preferably are present in innocuous contaminant quantities, on the order of less than about 1 percent by weight. Further, recombinant cell culture enables the production of LIF-R free of proteins which may be normally associated with LIF-R as it is found in nature in its species of origin, e.g. in cells, cell exudates or body fluids.

Uses of LIF-R Proteins and Compositions Comprising LIF-R

The LIF-R proteins disclosed herein find use as research reagents, as diagnostic reagents in in vitro assays, and in in vivo therapeutic procedures. Pharmaceutical compositions comprising an effective amount of LIF-R and a suitable diluent or carrier are provided by the present invention.

Cells expressing a membrane-bound recombinant LIF-R protein may be employed in studies of signal transduction; in various assays to detect LIF binding to the cells; or to analyze the ability of a particular protein (e.g., a soluble LIF-R) to compete with the membrane-bound LIF-R for binding of LIF. Labeled (e.g., radiolabeled) LIF-R may be used to assay a biological sample for LIF. Soluble LIF-R proteins are preferred for therapeutic use.

The present invention provides methods of using therapeutic compositions comprising a therapeutically effective amount of soluble LIF-R proteins and a suitable diluent and carrier, and methods for suppressing LIF-dependent biological responses in humans comprising administering an effective amount of soluble LIF-R protein. The therapeutically effective amount is an amount effective in ameliorating a LIF-mediated condition, and will vary according to the nature of the condition, the route of administration, the size of the patient, etc.

For therapeutic use, purified soluble LIF-R protein is administered to a patient, preferably a human, for treatment in a manner appropriate to the indication. Thus, for example, soluble LIF-R protein compositions can be administered by bolus injection, continuous infusion, sustained release from implants, or other suitable technique. Typically, a soluble LIF-R therapeutic agent will be administered in the form of a composition comprising purified protein in conjunction with physiologically acceptable carriers, excipients or diluents. Such carriers will be nontoxic to recipients at the dosages and concentrations employed. Ordinarily, the preparation of such compositions entails combining the LIF-R with buffers, antioxidants such as ascorbic acid, low molecular weight (less than about 10 residues) polypeptides, proteins, amino acids, carbohydrates including glucose, sucrose or dextrins, chelating agents such as EDTA, glutathione and other stabilizers and excipients. Neutral buffered saline or saline mixed with conspecific serum albumin are exemplary appropriate diluents. Preferably, product is formulated as a lyophilizate using appropriate excipient solutions (e.g., sucrose) as diluents. Appropriate dosages can be determined in trials.

Because LIF-R proteins bind to LIF, soluble LIF-R proteins can be used to competitively bind to LIF and thereby inhibit binding of LIF to cell surface LIF-R proteins. Soluble LIF-R is therefore expected to inhibit LIF-dependent biological activities. Soluble LIF-R may, for example, be useful in therapy to inhibit the effects of LIF induced cachexia in cancer patients or to treat lipoprotein metabolism defects such as atherosclerosis and obesity. Soluble LIF-R may also be useful in the treatment of disorders of bone and calcium metabolism or disorders associated with LIF overproduction associated with hepatocytes, neurons, and leukocytes. The regulation of embryonic and hematopoietic stem cells by LIF may also be manipulated with soluble LIF-R. Soluble LIF-R may also be used to treat leukemic cells which respond to LIF by proliferating.

LIF-R or antibodies to LIF-R may also be useful as a diagnostic reagent to detect diseases characterized by the presence of abnormal LIF-R.

Sense and Antisense Sequences

The present invention provides both double-stranded and single-stranded LIF-R DNA, and LIF-R mRNA as well. The single-stranded LIF-R nucleic acids have use as probes to detect the presence of hybridizing LIF-R nucleic acids (e.g., in in vitro assays) and as sense and antisense molecules to block expression of LIF-R.

In one embodiment, the present invention provides antisense or sense molecules comprising a single-stranded nucleic acid sequence (either RNA or DNA) capable of binding to target LIF-R mRNA (sense) or LIF-R DNA (antisense) sequences. These antisense or sense molecules may comprise a fragment of the coding region of LIF-R cDNA, and, in one embodiment, are oligonucleotides comprising at least about 14 nucleotides, preferably from about 14 to about 30 nucleotides, of a LIF-R cDNA sequence. The ability to create an antisense or sense oligonucleotide based upon a cDNA sequence for a given protein is described in, for example, Stein and Cohen, Cancer Res. 48:2659, 1988 and van der Krol et al., BioTechniques 6:958, 1988, which are hereby incorporated by reference.

Binding of antisense or sense oligonucleotides to target nucleic acid sequences results in the formation of duplexes that block translation (RNA) or transcription (DNA) by one of several means, including enhanced degradation of the duplexes, premature termination of transcription or translation, or by other means. The oligonucleotides thus may be used to block expression of LIF-R proteins. Uses of the antisense and sense nucleic acid sequences include, but are not limited to, use as research reagents. The biological effects of blocking LIF-R expression in cultured cells may be studied, for example. The oligonucleotides also may be employed in developing therapeutic procedures that involve blocking LIF-R expression in vivo.

Antisense or sense oligonucleotides further comprise oligonucleotides having modified sugar-phosphodiester backbones (or other sugar linkages, such as those described in WO91/06629) and wherein such sugar linkages are resistant to endogenous nucleases. Such oligonucleotides with resistant sugar linkages are relatively stable in vivo (i.e., capable of resisting enzymatic degradation) but retain sequence specificity for binding to target nucleotide sequences. Other examples of sense or antisense oligonucleotides include those oligonucleotides which are covalently linked to organic moieties such as those described in WO 90/10448, or to other moieties that increase affinity of the oligonucleotide for a target nucleic acid sequence, such as poly-(L-lysine). Further still, intercalating agents, such as ellipticine, and alkylating agents or metal complexes may be attached to sense or antisense oligonucleotides to modify binding specificities of the antisense or sense oligonucleotide for the target nucleotide sequence.

Antisense or sense oligonucleotides may be introduced into a cell containing the target nucleic acid sequence by any suitable method, including, for example, CaPO₄ -mediated DNA transfection, electroporation, or by using gene transfer vectors such as Epstein-Barr virus. A preferred method involves insertion of the antisense or sense oligonucleotide into a suitable retroviral vector, then contacting the target cell with the retrovirus vector containing the inserted sequence, either in vivo or ex vivo. Suitable retroviral vectors include, but are not limited to, the murine retrovirus M-MuLV, N2 (a retrovirus derived from M-MuLV), or the double copy vectors designated DCT5A, DCT5B and DCT5C (see PCT Application US 90/02656).

Sense or antisense oligonucleotides also may be introduced into a cell containing the target nucleotide sequence by attaching the oligonucleotide to a molecule that binds to the target cell, as described in WO 91/04753. The oligonucleotide may be attached to molecules that include, but are not limited to, antibodies, growth factors, other cytokines, or other ligands that bind to cell surface receptors.

Alternatively, a sense or an antisense oligonucleotide may be introduced into a cell containing the target nucleic acid sequence by formation of an oligonucleotide-lipid complex, as described in WO 90/10448. The sense or antisense oligonucleotide-lipid complex is preferably dissociated within the cell by an endogenous lipase.

The following examples are offered by way of illustration, and not by way of limitation.

EXAMPLES Example 1 Isolation and Expression of cDNAs Encoding Human LIF-R

A. Radiolabeling of LIF. Recombinant human LIF was expressed in yeast and purified to homogeneity essentially as described by Hopp, et al., Bio/Technology 6:1204, 1988. The purified protein was radiolabeled using a commercially available enzymobead radioiodination reagent (BioRad). In this procedure 10 μg rLIF in 50 μl 0.2M sodium phosphate, pH 7.2, are combined with 50 μl enzymobead reagent, 2 MCi of sodium iodide in 20 μl of 0.05M sodium phosphate pH 7.0 and 10 μl of 2.5% b-D-glucose. After 10 min at 25° C., sodium azide (20 μl of 50 mM) and sodium metabisulfite (10 μl of 5 mg/ml) were added and incubation continued for 5 min. at 25° C. The reaction mixture was fractionated by gel filtration on a 2 ml bed volume of Sephadex® G-25 (Sigma) equilibrated in Roswell Park Memorial Institute (RPMI) 1640 medium containing 2.5% (w/v) bovine serum albumin (BSA), 0.2% (w/v) sodium azide and 20 mM Hepes pH 7.4 (binding medium). The final pool of ¹²⁵ I-LIF was diluted to a working stock solution of 3×10⁻⁸ M in binding medium and stored for up to one month at 4° C. without detectable loss of receptor binding activity. The specific activity is routinely in the range of 6-8×10¹⁵ cpm/mmole LIF.

B. Membrane Binding Assays. Human placental membranes were incubated at 4° C. for 2 hr with ¹²⁵ I-LIF in binding medium, 0.1% bacitracin, 0.02% aprotinin, and 0.4% BSA in a total volume of 1.2 ml. Control tubes containing in addition a 100-fold molar excess of unlabeled LIF were also included to determine non-specific binding. The reaction mixture was then centrifuged at 15,000×g in a microfuge for 5 minutes. Supernatants were discarded, the surface of the membrane pellets carefully rinsed with ice-cold binding medium, and the radioactivity counted on a gamma counter. Using this assay, it was determined that the LIF-R was present on placental membranes, and up to 96 fmols of ¹²⁵ I-LIF could be bound per mg of placental membrane protein.

C. Construction and Screening of Placental cDNA Library. A tissue source for LIF-R was selected by screening various human cell lines and tissues for expression of LIF-R based on their ability to bind ¹²⁵ I-labeled LIF, prepared as described above in Example 1A. An unsized cDNA library was constructed by reverse transcription of polyadenylated mRNA isolated from total RNA extracted from the human placental tissue (Ausubel et al., eds., Current Protocols in Molecular Biology, Vol. 1, 1987). The cells were harvested by lysing the tissue cells in a guanidinium isothiocyanate solution and total RNA was isolated using standard techniques as described by Maniatis, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, 1982.

Polyadenylated RNA was isolated by oligo dT cellulose chromatography and double-stranded cDNA was prepared by a method similar to that of Gubler and Hoffman, Gene 25:263, 1983. Briefly, the polyadenylated RNA was converted to an RNA-cDNA hybrid with reverse transcriptase using oligo dT as a primer. The RNA-cDNA hybrid was then converted into double-stranded cDNA using RNAase H in combination with DNA polymerase I. The resulting double stranded cDNA was blunt-ended with T4 DNA polymerase. BglII adaptors were ligated to the 5' ends of the resulting blunt-ended cDNA as described by Haymerle, et al., Nucleic Acids Research 14:8615, 1986. The non-ligated adaptors were removed by gel filtration chromatography at 68° C., leaving 24 nucleotide non-self-complementary overhangs on the cDNA. The same procedure was used to convert the 5' BglII ends of the mammalian expression vector pDC303 to 24 nucleotide overhangs complementary to those added to the cDNA.

The eukaryotic expression vector pDC303 was designed to express cDNA sequences inserted at its multiple cloning site when transfected into mammalian cells, and also replicates in E. coli. pDC303 was deposited in E. coli cells with the America Type Culture Collection on Feb. 27, 1992, under the designation E. coli DH5α, SF CAV, and was assigned accession no. 68922. pDC303 was assembled from pDC201 (a derivative of pMLSV, previously described by Cosman et al., Nature 312:768, 1984), SV40 and cytomegalovirus DNA and comprises, in sequence: (1) SV40 sequences from coordinates 5171-270 containing the origin of replication, enhancer sequences and early and late promoters; (2) cytomegalovirus sequences containing the promoter and enhancer regions (nucleotides 6711 to +63 from the sequence published by Boechart et al., Cell 41:521, 1985; (3) adenovirus-2 sequences from coordinates 5779-6079 containing sequences for the first exon of the tripartite leader (TPL), coordinates 7101-7172 and 9634-9693 containing the second exon and pan of the third exon of the TPL and a multiple cloning site (MCS) containing sites for XhoI, KpnI, SmaI and BglI; (4) SV40 sequences from coordinates 4127-4100 and 2770-2533 containing the polyadenylation and termination signals for early transcription; (5) adenovirus sequences from coordinates 10532-11156 of the virus-associated RNA genes VAI and VAII; and (6) pBR322 sequences from coordinates 4363-486 and 1094-375 containing the ampicillin resistance gene and origin of replication.

Optimal proportions of adaptoted vector and cDNA were ligated in the presence of T4 polynucleotide kinase. Dialyzed ligation mixtures were electroporated into E. coli strain DH5α and transformants selected on ampicillin plates. Transformed E. coli cells were plated to provide approximately 2,400 colonies per plate and sufficient plates to provide approximately 240,000 total colonies per screen. Colonies were scraped from each plate, pooled, and plasmid DNA prepared from each pool. The pooled DNA was then used to transfect a subconfluent layer of monkey COS-7 cells. COS-7 cells were prepared for transfection by being maintained in complete medium (Dulbecco's modified Eagle's media (DMEM) containing 10% (v/v) fetal calf serum (FCS), 50 U/ml penicillin, 50 U/ml streptomycin, 2 mM L-glutamine) and then plated at a density of 2×10⁵ cells/well in either 6 well dishes (Falcon) or single well chambered slides (Lab-Tek). Both dishes and slides were pretreated with 1 ml human fibronectin (10 μg/ml in PBS) for 30 minutes followed by 1 wash with PBS. Media was removed from the adherent cell layer and replaced with 1.5 ml complete medium containing 134 mM chloroquine sulfate. 0.2 mls of DNA solution (2 μg DNA, 0.5 mg/ml DEAE-dextran in complete medium containing chloroquine) was then added to the cells and incubated for 5 hours. Following the incubation, the media was removed and the cells shocked by addition of complete medium containing 10% DMSO for 21/2 to 20 minutes followed by replacement of the solution with fresh complete medium. The cells were grown in culture to permit transient expression of the inserted sequences. These conditions led to an 80% transfection frequency in surviving COS-7 cells.

After 48 to 72 hours, transfected monolayers of COS-7 cells were assayed for expression of LIF binding proteins by binding radioiodinated LIF (prepared as described above) using the following slide autoradiography technique. Transfected COS-7 cells were washed once with binding medium (RPMI media 1640 containing 25 mg/ml bovine serum albumin (BSA), 2 mg/ml sodium azide, 20 mM HEPES, pH 7.2, and 50 mg/ml nonfat dry milk (NFDM) and incubated for 2 hours at 4° C. with 1 ml binding medium+NFDM containing 1.25×10⁻⁹ ¹²⁵ I-LIF. After incubation, cells in the chambered slides were washed three times with binding buffer+NFDM, followed by 2 washes with PBS, pH 7.3, to remove unbound ¹²⁵ I-LIF. The cells were fixed by incubating for 30 minutes at room temperature in 10% glutaraldehyde in PBS, pH 7.3, washed twice in PBS, and air dried. The slides were dipped in Kodak NTB-2 photographic emulsion (5× dilution in water) and exposed in the dark for 12 hours to 7 days at 4° C. in a light proof box. The slides were then developed for approximately 5 minutes in Kodak D 19 developer (40 g/500 ml water), rinsed in water and fixed in Agfa G433C fixer. The slides were individually examined with a microscope at 25-40× magnification and positive cells expressing LIF-R were identified by the presence of autoradiographic silver grains against a light background.

Cells in the 6 well plates were washed once with binding buffer+NFDM followed by washings with PBS, pH 7.3, to remove unbound ¹²⁵ I-LIF. The bound cells were then trypsinized to remove them from the plate and bound ¹²⁵ I-LIF were counted on a gamma counter.

Using the slide autoradiography approach, approximately 240,000 cDNAs were screened in pools of approximately 2,400 cDNAs until assay of one transfectant pool showed multiple cells clearly positive for LIF binding. This pool was then partitioned into pools of 600 and again screened by slide autoradiography and a positive pool was identified. This pool was further partitioned into pools of 60 and screened against by slide autoradiography until a positive pool was identified. Individual colonies from this pool of 60 were screened until a single clone (clone 65) was identified which directed synthesis of a surface protein with detectable LIF binding activity. This clone was isolated, and its insert is sequenced to determine the sequence of the human LIF-R cDNA clone 65. The pDC303 cloning vector containing the human LIF-R cDNA clone 65 was deposited with the American Type Culture Collection, Rockville, Md., USA (ATCC) on Dec. 11, 1990, under the name pHLIFR-65, and was assigned ATCC Accession Number 68491.

The nucleotide sequence of the cDNA insert of clone 65 is presented, along with the amino acid sequence encoded thereby, in SEQ ID NOS:1 and 2. Amino acids -44 through -1 constitute the signal peptide.

D. Binding to Intact Cells. Binding assays done with DA-1 cells grown in suspension culture were performed by a phthalate oil separation method (Dower et al., J. Immunol. 132:751, 1984) essentially as described by Park et al., J. Biol. Chem 261:4177, 1986 and Park et al., Proc. Natl. Acad. Sci. USA 84:5267, 1987. Nonspecific binding of ¹²⁵ I-LIF was measured in the presence of a 200-fold or greater molar excess of unlabeled LIF. Sodium azide (0.2%) was included in all binding assays to inhibit internalization of ¹²⁵ I-LIF at 37° C. The DA-1 cells bound ¹²⁵ I-LIF, and approximately 200 LIF receptors were determined to be present on the surface cells with an affinity constant (K_(a)) of about 7.4×10⁸ M⁻¹.

Plasmid DNA from LIF receptor expression plasmid was used to transfect a subconfluent layer of monkey COS-7 cells using DEAE-dextran followed by chloroquine treatment, as described by Luthman et al. (Nucl Acids Res. 11:1295, 1983) and McCutchan et al. (J. Natl. Cancer Inst. 41:351, 1968). The cells were then grown in culture for three days to permit transient expression of the inserted sequences. After three days the cell monolayers were assayed for ¹²⁵ I-LIF binding essentially as described by Mosley et al., Cell 59:335, 1989. Non-specific binding of ¹²⁵ I-LIF was measured in the presence of 200-fold or greater excess of unlabeled LIF. Initial binding studies of ¹²⁵ I-LIF to COS cells transfected with LIF-R cDNA clone 65 indicated that high affinity binding (K_(a) >1×10⁹ M⁻¹) was apparent following Scatchard analysis. pDC303 control vector transfected cells indicated that background endogenous LIF receptors are present on COS-7 cells. Control vector transfected cells expressed 130 high-affinity LIF receptors (K_(a) =4.2×10¹⁰ M⁻¹) and 2,400 receptors with lower affinity (K_(a) =7.8×10⁸ M⁻¹). COS-7 cells were transfected with pDC303 containing LIF-R clone 65, and transfected cells were diluted 1:10 in cells that had been transfected with control pDC303 vector. This strategy was utilized as recombinant receptor expression can often be too great to allow accurate determinations of ligand-receptor affinity. Results of these experiments indicate that both affinity classes of LIF receptors were present following transfection with LIF-R clone 65. Approximately 178 high-affinity sites (K_(a) =1.4×10⁻¹¹ M⁻¹) and 9800 lower affinity sites (K_(a) =1.4×10¹¹ M⁻¹) were present on the LIF-R transfectants.

E. ³⁵ S-Labeling and Affinity Purification of LIF-R. COS-7 cells transfected with pDC303 or pDC303 containing the human LIF-R cDNA clone 65 were radiolabeled with ³⁵ S-cysteine/methionine. Detergent extracts of radiolabeled cells were prepared as described (Mosley et al., supra). LIF affinity matrices were prepared by coupling recombinant human LIF to cyanogen bromide-activated Sepharose (Pharmacia) or Hydrazide Affigel (Biorad), according to manufacturer's recommendations. A protein of M_(r) approximately 190,000 was detected following affinity purification with either matrix, and SDS-PAGE analysis of LIF-R clone 65 COS-7 cell lysates, but was undetectable in control vector transfectants. The LIF-R clone 65 cDNA predicts a molecular weight of 111,374 and likely a high degree of glycosylation makes up the difference between this size and the observed M_(r) of 190,000. Additionally, since the clone 65 LIF-R does not contain a stop codon in the 3' end of the cDNA, translation terminates 3 amino acids into the expression vector. Thus 401 daltons of the 190,000 LIF-R protein are encoded by these vector sequences.

F. Assay for Soluble LIF-R Proteins. Subclones derived from the extracellular domain of LIF-R cDNA clone 65 are assayed for the ability to encode soluble LIF-R proteins. COS-7 cells transfected with an expression vector containing the subcloned LIF-R cDNA are cultured to allow expression and secretion of the LIF-R protein. The presence of soluble LIF receptors in COS-7 supernatants is measured by inhibition of [¹²⁵ I]LIF binding to pHLIFR-65 transfected COS-7 cells. Supernatants from control and soluble LIFR subclone transfected COS-7 cells are harvested in DMEM with 0.1% FCS t hree days post-transfection. [¹²⁵ I]LIF binding was assessed as described above in the presence of 0.5 ml conditioned media, or in the presence or absence of 200-fold molar excess unlabeled LIF. Analogous procedures may be employed to test for the presence of other soluble LIF-R proteins (e.g., soluble murine LIF-R) in culture supernatants.

A probe derived from clone 65 was used to isolate additional human LIF-R clones, as described in Example 4. A composite human LIF-R sequence, derived by sequencing and alignment of cDNA and genomic clones, is presented in SEQ ID NOS:5 and 6.

Example 2 Isolation and Purification of cDNA Clones Encoding Murine LIF-R

A murine LIF-R cDNA was isolated from a library made from mouse liver cDNA (Stratagene, San Diego, Cat. #35302), by cross-species hybridization with a human LIF-R probe. A double-stranded human LIF-R probe was produced by excising a BglII fragment of the human LIF-R clone 65 and 32P-labeling the cDNA using random primers (Boehringer-Mannheim). A total of about 5×10⁵ plaques were screened with the human probe in 35% formamide. Murine LIF-R cDNA clone 3 was isolated. This particular clone encoded a soluble version of the LIF receptor. The coding region encodes a LIF-R having about 70% identity at the amino acid level (80% similarity) and about 78% identity at the nucleotide level to the human LIF-R in the region of overlap.

The nucleotide sequence of the murine LIF-R cDNA of clone 3, and the amino acid sequence encoded thereby, are presented in SEQ ID NOS:3 and 4. The protein comprises a signal peptide (amino acids -43 to -1).

Example 3 Preparation of Monoclonal Antibodies to LIF-R

Preparations of purified recombinant LIF-R, for example, human LIF-R, or transfected COS cells expressing high levels of LIF-R are employed to generate monoclonal antibodies against LIF-R using conventional techniques. The immunogen may comprise a LIF-R protein (or fragment thereof, such as the extracellular domain) fused to the peptide Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys (DYKDDDDK) (Hoppet al., Bio/Technology 6:1204, 1988 and U.S. Pat. No. 5,011,912) or fused to the Fc portion of an antibody, as described above. Procedures for producing monoclonal antibodies include, for example, those disclosed in U.S. Pat. No. 4,411,993. Such antibodies are likely to be useful in interfering with LIF binding to LIF-R, for example, in ameliorating toxic or other undesired effects of LIF, or as components of diagnostic or research assays for LIF or soluble LIF-R.

To immunize mice, LIF-R immunogen is emulsified in complete Freund's adjuvant and injected in amounts ranging from 10-100 μg subcutaneously into Balb/c mice. Ten to twelve days later, the immunized animals are boosted with additional immunogen emulsified in incomplete Freund's adjuvant and periodically boosted thereafter on a weekly to biweekly immunization schedule. Serum samples are periodically taken by retro-orbital bleeding or tail-tip excision for testing by dot-blot assay (antibody sandwich) or ELISA (enzyme-linked immunosorbent assay). Other assay procedures are also suitable. Following detection of an appropriate antibody titer, positive animals are given an intravenous injection of antigen in saline. Three to four days later, the animals are sacrificed, splenocytes harvested, and fused to the murine myeloma cell line NS. Other suitable known myeloma cell lines may be employed in place of NS1. A prefered murine myeloma cell line is P3x63Ag8.653 (ATCC CRL 1580). Hybridoma cell lines generated by this procedure are plated in multiple microliter plates in a HAT selective medium (hypoxanthine, aminopterin, and thymidine) to inhibit proliferation of non-fused cells, myeloma hybrids, and spleen cell hybrids.

Hybridoma clones thus generated can be screened by ELISA for reactivity with LIF-R, for example, by adaptations of the techniques disclosed by Engvall et al., Immunochem. 8:871 (1971) and in U.S. Pat. No. 4,703,004. A preferred screening technique is the antibody capture technique described in Beckmann et al., (J. Immunol. 144:4212, 1990). Positive clones are then injected into the peritoneal cavities of syngeneic Balb/c mice to produce ascites containing high concentrations (>1 mg/ml) of anti-LIF-R monoclonal antibody. The resulting monoclonal antibody can be purified by ammonium sulfate precipitation followed by gel exclusion chromatography, and/or affinity chromatography based on binding of antibody to Protein A of Staphylococcus aureus.

Example 4 Isolation of Additional Human LIF-R Clones

Additional human LIF-R DNA sequences were isolated by screening human cDNA and genomic libraries with a probe derived from the human LIF-R cDNA isolated in Example 1. Sequencing and alignment of these clones produced the composite human LIF-R sequence presented in SEQ ID NOS:5 and 6.

The entire cDNA insert of pHLIFR-65 (Example 1) was excised by digestion with BglII, radiolabeled using a random priming kit (Stratagene Cloning Systems, La Jolla, Calif.), and used as a hybridization probe to screen the human placental cDNA library from which pHLIFR-65 was derived (Example 1). Hybridization procedures were essentially as described by Goodwin et al., supra, 1989. Positive clones were detected following high stringency washing conditions (0.2×XSSC, 0.1% SDS at 65° C.).

DNA sequences of hybridizing clones were determined using vector- and cDNA-derived oligonucleotide primers on denatured double-stranded templates following shotgun and directed subcloning according to standard procedures (Sambrook, J., Fritsch, E. F. and Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory Press, New York).

As shown in SEQ ID NO:1, the cDNA insert in pHLIFR-65 (from Example 1) encoded a single large open reading frame that had no in-frame translation termination signal at its 3' end and, instead, ended in a stretch of 15 adenines (beginning after nucleotide 3143 in SEQ ID NO:1) that were not preceded by a typical polyadenylation signal. The open reading frame was terminated by an in-frame translational stop codon following 15 additional amino acids encoded by the Bgl II adaptors employed in library construction and by the expression vector. The 3' end of each additional isolated cDNA clone coincided with this poly-A stretch. Determination of the complete DNA sequence of one cDNA clone indicated that the sequences upstream of the poly A segment were identical to that of pHLIFR-65. Partial sequences were determined for the other clones, which matched corresponding portions of the pHLIFR-65 cDNA.

Based on the assumption that these cDNAs were the result of oligo(dT) priming at an intemal site in the human LIFR mRNA during construction of the libraries, a human genomic library was screened with both the above-described probe containing the entire pHLIFR cDNA insert, and also with a ³² P-labeled oligonucleotide having the sequence of nucleotides 3099-3115 of SEQ ID NO:1 (near the 3' end of human clone 65). Four hybridizing clones were isolated. A subclone derived from one of the genomic clones (HLIFR-gen1) contained sequence that extended the cDNA sequence beyond the point at which the poly-A stretch of nucleotides began in the cDNA clones.

The sequence of the open reading frame deduced by alignment of pHLIFR-65 cDNA with the 3' genomic sequence (until the first in-frame stop codon was encountered) is presented in FIGS. 2a through 2e and in SEQ ID NOS:5 and 6. In FIG. 2, the signal peptide comprises amino acids -44 to -1. The transmembrane domain is heavily underlined. Potential N-linked glycosylation sites are marked with asterisks. Hallmark residues associated with the hematopoietin family of receptors (Cosman et al., Trends Biochem. Sci., 15:265 (1990) are shown boxed. The horizontal arrow marks the point at which genomic sequence was used to derive the 3' coding region of the HLIFR. All cDNA clones terminated with a stretch of A nucleotides at this point.

Comparison and alignment of the sequences of the positive clones produced the composite map of FIG. 1. In FIG. 1, clones isolated from the placental cDNA and genomic libraries are designated by "p" and "gen", respectively. The HLIFR open reading frame is shown boxed. The signal sequence is shown as a hatched box and the transmembrane domain is shown as a solid box. Some restriction endonuclease cleavage sites are shown.

In order to confirm that the genomic sequence used to complete the amino acid sequence of the HLIFR cytoplasmic domain was exonic, we used a PCR-based approach to detect the contiguous sequence assembled in FIGS. 2a through 2e in human placental cDNA. First strand cDNA was prepared on a human placental mRNA template, using random primers in place of the oligo dT primer suspected of annealing to an internal poly-A site in the previously-described cDNA library. The first strand cDNA was used as a template in a PCR reaction primed with oligonucleotides that span two introns in the HLIFR gene (intron 1 of >700 bp @nt 2770 and intron 2 of >900 bp @nt 2848 in FIG. 2d. The 5' primer employed in the PCR reaction (an oligonucleotide, the 5' end of which is at position 2720 of FIG. 2d) anneals within the transmembrane region of the human LIF-R DNA. One 3' primer (an oligonucleotide having a 5' end at position 3233 of FIG. 2e) anneals downstream of the point at which a poly-A segment is found in the previously isolated cDNAs (but within the coding region). The second 3' primer (an oligonucleotide having a 5' end at position 3529 of FIG. 2e) anneals within the 3' non-coding region of the FIG. 2e LIF-R sequence. The 5' oligonucleotide is based on the sequence of pHLIFR-65 and the 3' oligonucleotides are based on the sequence of the genomic clone. The PCR reaction products were separated by electrophoresis on an agarose gel, then transferred to nitrocellulose. The blot was probed with a 17-mer ³² P-labeled oligonucleotide (nucleotides 3099-3115 of FIG. 2). First strand cDNA synthesis, the polymerase chain reaction and blotting from agarose gels were performed by procedures analogous to those described by Gearing et al., EMBO J. 8:3667-3676 (1989).

Amplification products expected if the composite FIG. 2 DNA sequence is exonic are 513 base pairs (5' primer to first 3' primer) and 809 base pairs (5' primer to the 3' primer in the noncoding region) in length. Specific amplification products of the predicted size were detected following PCR with the cDNA but not with genomic DNA as template. Since no bands were detected in the genomic PCR products it is likely that the distance between the primers (which includes the two introns discussed above) was too great for efficient PCR under the conditions used. The assembled sequence in FIGS. 2a through 2e and SEQ ID NO:5 therefore corresponds to the true sequence of the human LIFR cDNA.

A DNA sequence comprising the full length coding region shown in FIGS. 2a through 2e prepared by a number of different techniques. PCR reaction products produced as described above (with the 5' primer and the 3' primer that anneals in the 3' non-coding region) may be joined to the LIF-R cDNA of pHLIF-R-65 (example 1; contains 5' end of LIF-R sequence) by using a restriction endonuclease that cleaves within the region of overlap with the LIF-R cDNA of pHLIF-R-65. Computer programs that print out restriction sites within a given DNA sequence are known and available. In another approach, the genomic LIF-R DNA isolated above may be substituted for the PCR-amplified DNA, and joined to the pHLIF-R cDNA. Alternatively, the 3' end of the full length human LIF-R sequence may be chemically synthesized by conventional procedures and ligated to the pHLIF-R-65 cDNA (digested with a suitable restriction enzyme). As an additional alternative, a human placental cDNA library prepared using random primers for first strand cDNA synthesis may be screened with a probe derived from pHLIF-R-65 or the genomic clone isolated above to identify a full length cDNA clone.

The extracellular domain of the human LIF-R has homology to members of the hematopoietin receptor family (Cosman et al., supra) and contains two hematopoietin receptor domains (defined from the first conserved Cys residue to the Trp-Ser-X-Trp-Ser motif) and three repeats of a fibronectin type III-like module (FN III). The three FNIII domains span amino acids 487 (Thr) through 789 (Ser) of the SEQ ID NOS:2 and 6 sequences.

The presence of human LIFR cDNA clones in libraries prepared from placenta and liver suggests that the LIFR mRNA is normally expressed in these tissues. In order to define the size of the full-length LIFR mRNA, the cDNA insert of pHLIFR-65 was used to detect LIFR transcripts in human placental RNA. Resolution of RNA samples in agarose gels and transfer to nylon filters was accomplished as described previously (Goodwin et al., supra 1989). Blots were hybridized overnight with the entire insert of pHLIFR-65 that had been radiolabeled using a random priming kit (Stratagene), and washed using high stringency conditions.

Two major RNA species of ˜6 kb and ˜4.5 kb and a minor band of 5 kb were detected. These RNA species may represent alternately spliced transcripts, such as transcripts for membrane bound and soluble forms of the human LIF receptor, or transcripts utilizing different poly(A) addition signals.

Example 5 Soluble Human LIF-R/Fc Homodimer

An expression vector encoding a fragment of the human LIF-R extracellular domain fused to a polypeptide derived from the Fc region of an antibody was constructed as follows. Disulfide bonds form between the Fc portions of the expressed fusion proteins, creating homodimers.

Plasmid pHLIF-R-65, which contains human LIF-R cDNA in expression vector pDC303 as described in example 1, was digested with the restriction enzymes Asp718 and XmnI. Asp718 cleaves the vector upstream of the LIF-R cDNA insert. XmnI is a blunt cutter that cleaves within the codon for amino acid number 702 (Asp) of SEQ ID NO:1, upstream of the transmembrane region. The desired Asp718/XmnI fragment (about 2,444 bp in length) was separated by electrophoresis on an agarose gel and purified by conventional procedures, using an Elutip column.

CDNA encoding a single chain polypeptide derived from the Fc region of a human IgG1 antibody has been cloned into a pBLUESCRIPT SK® vector (Stratagene Cloning Systems, La Jolla, Calif.) to produce a recombinant vector designated hIgG1Fc. A polylinker region comprising a number of restriction sites is positioned immediately upstream of the Fc cDNA. The DNA and encoded amino acid sequences of the cloned Fc cDNA are presented in SEQ ID NO:7 and SEQ ID NO:8 (amino acids 14-245). Amino acids 1-13 of SEQ ID NOS:7 and 8 are encoded by the poly linker DNA segment. FIG. 4 shows the positions of cleavage sites for a number of restriction enzymes in the polylinker and the 5' end of the Fc DNA.

The Fc polypeptide encoded by the cDNA extends from the N-terminal hinge region to the native C-terminus, i.e., is an essentially full-length antibody Fc region. Fc fragments, e.g., those that are truncated at the C-terminal end, also may be employed. The fragments should contain multiple cysteine residues (at least the cysteine residues in the hinge reaction). The antibody from which the Fc polypeptide is derived is preferably of the same species as the patient to be treated with the fusion protein prepared therefrom.

Plasmid hIgG1Fc was digested with Asp718 and StuI, which cleave within the polylinker. The Asp 718/XmnI LIF-R fragment prepared above was ligated into the cleaved hIgG1Fc vector by conventional techniques. StuI and XmnI both produce blunt ends, which will ligate together. In the resulting recombinant vector, the Fc encoding sequence is positioned downstream of, and in the same reading frame as, the LIF-R sequence. The encoded LIF-R/Fc fusion protein comprises amino acids -44 to 702 of SEQ ID NO: 1, followed by amino acids 8 to 245 of SEQ ID NO:7. Amino acids 8 to 13 of SEQ ID NO:7 constitute a peptide linker encoded by the polylinker segment in this fusion protein. E. coli cells were transformed with the ligation mixture and plasmids were isolated therefrom by standard procedures. Plasmid vectors containing the desired DNA insert were identified by restriction endonuclease digestion analysis.

The cloned DNA segment encoding the LIF-R/Fc fusion polypeptide was excised from the recombinant vector by digestion with Asp718 and NotI. The NotI enzyme cleaves the vector in a polylinker region just downstream of the Fc cDNA insert. The excised DNA segment (3.2 kb) is inserted into an appropriate expression vector, depending on the type of host cell that is desired. One suitable expression vector is pDC304, a mammalian expression vector that is virtually identical to pDC303 (ATCC 68922, described in example 1) except that pDC304 contains a NotI site in the multiple cloning site (mcs). pDC304 is designed to express cDNA inserted into the mcs after transfection into mammalian cells.

pDC304 was cleaved with Asp718 and NotI, both of which cleave in the mcs. The LIF-R/Fc-encoding Asp718/NotI DNA fragment prepared above was ligated into the vector. COS-7 (monkey kidney) cells were transfected with the expression vector encoding the LIF-R/Fc fusion. The transfected cells were cultivated to allow expression of the fusion protein comprising the Fc polypeptide fused in frame (via the peptide linker) to the C-terminus of the LIF-R fragment. Disulfide bonds that form between the two Fc regions covalently link the two separate fusion polypeptides into a homodimer comprising two LIF-R polypeptides joined via disulfide bonds between the Fc moieties fused thereto. The LIF-R/Fc homodimer is a soluble protein.

The homodimer receptor protein may be purified using any of a number of conventional protein purification techniques. Since antibody Fc regions bind to protein A and protein G, affinity chromatography employing protein A or protein G attached to an insoluble support material may be employed in the purification process. In one procedure, one liter of culture supernatant containing the receptor is passed over a solid phase protein G column, and the column is then washed thoroughly with phosphate-buffered saline (PBS). The adsorbed Fc-containing fusion protein is eluted with 50 mM glycine buffer, pH 3, and brought to pH 7 with 2M Tris buffer, pH 9. Further purification may involve immunoaffinity column(s), e.g., affinity columns having LIF bound thereto.

In order to confirm dimer formation, COS-7 cells transfected with the hLIF-R/Fc-encoding expression vector were incubated with a mixture of ³⁵ S-methionine and ³⁵ S-cysteine for 3 hours. Duplicate 1-ml aliquots of the culture supernatant were incubated with 50 μl Protein G Sepharose beads (20% v/v, available from Pharmacia) overnight at 4° C. The beads were then pelleted by centrifugation, and protein was recovered from the beads with protein sample buffer ±β-mercaptoethanol (BME). The molecular weight of the recovered protein was analyzed by SDS-PAGE. As expected, a protein band corresponding to the LIF-R/Fc monomer (about 160 kd) was visualized for samples treated with the BME reducing agent. A band corresponding to a protein of about 320 kd (double the monomer size) was seen on the -BME sample gel. No 160 kd (monomer) band was visible on the -BME sample gel. The dimer is believed to form either intracellularly or upon secretion from the transfected cells.

The binding affinity of the homodimeric receptor for LIF was determined by performing a variation of a standard Scatchard analysis. The binding assay procedure was similar to that described by Mosley et al. (Cell 59:335, 1989) except that the receptor is attached to Protein G Sepharose beads, rather than being on the surface of transfected cells, during the assay. The LIF-R/FC fusion protein attached to the beads is believed to be at least predominantly in dimeric form, as indicated above.

COS cells transfected with the hLIF-R/Fc-encoding expression vector were cultivated for 3 days to allow expression and secretion of the hLIF-R/Fc protein. 14 mls of culture supernatant were mixed with 700 μl of 20% (v/v) Protein G Sepharose beads in PBS+0.1% Triton X, and incubated overnight at 4° C. on a rocking platform. The beads were then washed twice with Binding Media (RPMI 1640 medium containing 2.5% bovine serum albumin, 0.2% (v/v) sodium azide and 20 mM Hepes, pH 7.4) and resuspended to 1.7 mls with Binding Media. In a 96-well microtiter plate, samples comprising 50 μl of the resuspended beads plus 50 μBinding Medium plus one of ten 1:2 serial dilutions of ¹²⁵ I-LIF were incubated for 2 hours at 4° C. with shaking.

Tubes containing 250 μnewborn calf serum (NCS) were used in place of the phthalate oil-containing tubes used in the separation method referred to in example 1, section D. Duplicate 50 μsamples from the microtiter plate were applied to the tubes, which were then spun in a microfuge. Tubes were then cut, the radioactivity counted, and processed as for standard Scatchard analysis. The binding affinity of the homodimer for LIF (9×10⁸ M⁻¹) was comparable to that of the LIF-R encoded by clone 65 cDNA (1.1×10⁹ M⁻¹).

In an alternative construct, vector pHLIF-R-65 is cleaved with the restriction enzymes Asp718 and Bsp1286I. Asp718 cleaves in the vector upstream of the LIF-R cDNA insert. Bsp1286I cleaves just 3' of the codon for Val (amino acid 775 in SEQ ID NO: 1). The Asp718/Bsp1286I LIF-R DNA fragment may be fused to an Fc polypeptide-encoding DNA fragment using suitable oligonucleotide linkers if desired. An additional alternative construct may be prepared by digesting hIgG1Fc with Asp718 and BglII. The BglII site shown in FIG. 4 (within the Fc sequence, near the 5' end) is unique. An oligonucleotide may be employed to regenerate the 5' end of the Fc sequence (through the codon for Glu at position 13) and add a suitable restriction site (e.g., for XmnI or Bsp1286I) for joining a LIF-R sequence to the Fc sequence.

Description of the Sequence Listing

SEQ ID NO:1 and SEQ ID NO:2 show the nucleotide sequence and encoded amino acid sequence of human LIF-R clone 65. This particular clone is a 5' fragment, lacking the the 3' end of the coding region. The coding region spans from nucleotides 179-3182. The partial amino acid sequence of the mature peptide encoded by this nucleotide sequence is defined by amino acids 1-957. The predicted signal peptide is defined by amino acids -44 through -1. Though truncated at the C-terminus, the LIF-R protein encoded by clone 65 is capable of binding LIF, as described in Example 1.

SEQ ID NO:3 and SEQ ID NO:4 show the nucleotide sequence and encoded amino acid sequence of murine LIF-R clone 3. This particular clone is a naturally occurring soluble form of the murine LIF-R and has no transmembrane region. The absence of the transmembrane region allows the protein molecule to be transported through the cell membrane. The coding region spans from nucleotides 53-2212. The amino acid sequence of the mature peptide encoded by this nucleotide sequence is defined by amino acids 1-676. The predicted signal peptide is defined by amino acids -43 through -1.

SEQ ID NO:5 and SEQ ID NO:6 show a full length human LIF-R nucleotide sequence and the amino acid sequence encoded thereby. These sequences are composites derived from the sequencing of cDNA and genomic clones, as described in Example 4. The protein comprises a signal sequence (amino acids -44 through -1) followed by an extracellular domain (amino acids 1-789), a transmembrane region (amino acids 790-815), and a cytoplasmic domain (amino acids 816-1054).

SEQ ID NO:7 and SEQ ID NO:8 show the nucleotide sequence and amino acid sequence of a polylinker-encoded peptide (amino acids 1-13) fused to a polypeptide derived from the Fc region of a human IgG1 antibody (amino acids 14-245). The Fc polypeptide extends from the hinge region to the native C-terminus.

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (i) APPLICANT: Gearing, David P.                                               Beckmann, M. P.                                                                (ii) TITLE OF INVENTION: Leukemia Inhibitory Factor Receptors                  (iii) NUMBER OF SEQUENCES: 8                                                   (iv) CORRESPONDENCE ADDRESS:                                                   (2) INFORMATION FOR SEQ ID NO:1:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 3182 base pairs                                                    (B) TYPE: nucleic acid                                                          (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA to mRNA                                               (iii) HYPOTHETICAL: N                                                          (iv) ANTI-SENSE: N                                                             (v) FRAGMENT TYPE: N-terminal                                                  (vi) ORIGINAL SOURCE:                                                          (F) TISSUE TYPE-: Placenta                                                     (vii) IMMEDIATE SOURCE:                                                        (B) CLONE: hulifr.65                                                           (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                               (B) LOCATION: 179..3182                                                       (D) OTHER INFORMATION:                                                         (ix) FEATURE:                                                                  (A) NAME/KEY: matpeptide                                                       (B) LOCATION: 311..3179                                                        (D) OTHER INFORMATION:                                                         (ix) FEATURE:                                                                  (A) NAME/KEY: sigpeptide                                                       (B) LOCATION: 179..310                                                         (D) OTHER INFORMATION:                                                         (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                        AGATCTTGGAACGAGACGAC CTGCTCTCTCTCCCAGAACGTGTCTCTGCTGCAAGGCACC60                GGGCCCTTTCGCTCTGCAGAACTGCACTTGCAAGACCATTATCAACTCCTAATCCCAGCT120                CAGAAAGGGAGCCTCTGCGACTCATTCATCGCCCTCCAGGACTGACTGCATTGCACAG178                  ATGATGGATATTTACGTATGTTIGAAACGACCATCCTGGATGGTGGAC226                            MetMetAspIleTyrValCysLeuLysArgProSerTrpMetValAsp                               44- 40-35- 30                                                                  AATAAAAGAATGAGGACTGCTTCAAATTTCCAGTGGCTGTTATCAACA274                            AsnLysArgMetArgThrAlaSerAsnPheGlnTruLeuLeuSerThr                               25-20- 15                                                                      TTTATTCTTCTATATCTAATGAATCAAGTAAATAGCCAGAAAAAGGGG322                            PheIleLeuLeuTyrLeuMetAsnGlnValAsnSerGlnLysLysGly                               10-51                                                                          GCT CCTCATGATTTGAAGTGTGTAACTAACAATTTGCAAGTGTGGAAC370                           AlaProHisAspLeuLysCysValThrAsnAsnLeuGlnValTrpAsn                               5101520                                                                         TGTTCTTGGAAAGCACCCTCTGGAACAGGCCGTGGTACTGATTATGAA418                           CysSerTrpLysAlaProSerGlyThrGlyArgGlyThrAspTyrGlu                               253035                                                                         GTTTGCATTGAAAACAGGTCCCGTTCTTGTTATCAGTTGGAGAAAACC466                            ValCysIleGluAsnArgSerArgSerCysTyrGlnLeuGluLysThr                               404550                                                                         A GTATTAAAATTCCAGCTCTTTCACATGGTGATTATGAAATAACAATA514                           SerIleLysIleProAlaLeuSerHisGlyAspTyrGluIleThrIle                               556065                                                                         AATTCT CTACATGATTTTGGAAGTTCTACAAGTAAATTCACACTAAAT562                           AsnSerLeuHisAspPheGlySerSerThrSerLysPheThrLeuAsn                               707580                                                                         GAACAAAACGTTTCC TTAATTCCAGATACTCCAGAGATCTTGAATTTG610                           GluGlnAsnValSerLeuIleProAspThrProGluIleLeuAsnLeu                               859095100                                                                      TCTGCTGATTT CTCAACCTCTACATTATACCTAAAGTGGAACGACAGG658                           SerAlaAspPheSerThrSerThrLeuTyrLeuLysTrpAsnAspArg                               105110115                                                                      GGTTCAGTTT TTCCACACCGCTCAAATGTTATCTGGGAAATTAAAGTT706                           GlySerValPheProHisArgSerAsnValIleTrpGluIleLysVal                               120125130                                                                      CTACGTAAAGAG AGTATGGAGCTCGTAAAATTAGTGACCCACAACACA754                           LeuArgLysGluSerMetGluLeuValLysLeuValThrHisAsnThr                               135140145                                                                      ACTCTGAATGGCAAAGAT ACACTTCATCACTGGAGTTGGGCCTCAGAT802                           ThrLeuAsnGlyLysAspThrLeuHisHisTrpSerTrpAlaSerAsp                               150155160                                                                      ATGCCCTTGGAATGTGCCATTCATTT TGTGGAAATTAGATGCTACATT850                           MetProLeuGluCysAlaIleHisPheValGluIleArgCysTyrIle                               165170175180                                                                   GACAATCTTCATTTTTCTGGTC TCGAAGAGTAGAGTGACTGGAGCCCT898                           AspAsnLeuHisPheSerGlyLeuGluGluTrpSerAspTrpSerPro                               185190195                                                                      GTGAAGAACATTTCTTGGATA CCTGATTCTCAGACTAAGGTTTTTCCT946                           ValLysAsnIleSerTrpIleProAspSerGlnThrLysValPhePro                               200205210                                                                      CAAGATAAAGTGATACTTGTAGGC TCAGACATAACATTTTGTTGTGTG994                           GlnAspLysValIleLeuValGlySerAspIleThrPheCysCysVal                               215220225                                                                      AGTCAAGAAAAAGTGTTATCAGCACTGAT TGGCCATACAAACTGCCCC1042                          SerGlnGluLysValLeuSerAlaLeuIleGlyHisThrAsnCysPro                               230235240                                                                      TTGATCCATCTTGATGGGGAAAATGTTGCAATCAAGA TTCGTAATATT1090                          LeuIleHisLeuAspGlyGluAsnValAlaIleLysIleArgAsnIle                               245250255260                                                                   TCTGTTTCTGCAAGTAGTGGAACAAATGTAGTT TTTACAACCGAAGAT1138                          SerValSerAlaSerSerGlyThrAsnValValPheThrThrGluAsp                               265270275                                                                      AACATATTTGGAACCGTTATTTTTGCTGGATAT CCACCAGATACTCCT1186                          AsnIlePheGlyThrValIlePheAlaGlyTyrProProAsnThrPro                               280285290                                                                      CAACAACTGAATTGTGAGACACATGATTTAAAAGA AATTATATGTAGT1234                          GlnGinLeuAsnCysGluThrHisAspLeuLysGluIleIleCysSer                               295300301                                                                      TGGAATCCAGGAAGGGTGACAGCGTTGGTGGGCCCACGTG CTACAAGC1282                          TrpAsnProGlyArgValThrAlaLeuValGlyProArgAlaThrSer                               310315320                                                                      TACACTTTAGTTGAAAGTTTTTCAGGAAAATATGTTAGACTTAAAAGA 1330                          TyrThrLeuValGluSerPheSerGlyLysTyrValArgLeuLysArg                               325330335340                                                                   GCTGAAGCACCTACAAACGAAAGCTATCAATTATTATTTCAAATG CTT1378                          AlaGluAlaProThrAsnGluSerTyrGlnLeuLeuPheGlnMetLeu                               345350355                                                                      CCAAATCAAGAAATATATAATTTTACTTTGAATGCTCACAATCC GCTG1426                          ProAsnGlnGluIleTyrAsnPheThrLeuAsnAlaHisAsnProLeu                               360365370                                                                      GGTCGATCACAATCAACAATTTTAGTTAATATAACTGAAAAAGTTT AT1474                          GlyArgSerGlnSerThrIleLeuValAsnIleThrGluLysValTyr                               375380385                                                                      CCCCATACTCCTACTTCATTCAAAGTGAAGGATATTAATTCAACAGCT 1522                          ProHisThrProThrSerPheLysValLysAspIleAsnSerThrAla                               390395400                                                                      GTTAAACTTTCTTGGCATTTACCAGGCAACTTTGCAAAGATTAATTTT1570                           ValL ysLeuSerTrpHisLeuProGlyAsnPheAlaLysIleAsnPhe                              405410415420                                                                   TTATGTGAAATTGAAATTAAGAAATCTAATTCAGTACAAGAGCAGCGG1618                            LeuCysGluIleGluIleLysLysSerAsnSerValGlnGluGlnArg                              425430435                                                                      AATGTCACAATCAAAGGAGTAGAAAATTCAAGTTATCTTGTTGCTCTG1666                            AsnValThrIleLysGlyValGluAsnSerSerTyrLeuValAlaLeu                              440445450                                                                      GACAAGTTAAATCCATACACTCTATATACTTTTCGGATTCGTTGTTCT1714                           As pLysLeuAsnProTyrThrLeuTyrThrPheArgIleArgCysSer                              455460465                                                                      ACTGAAACTTTCTGGAAATGGACCAAATGGAGCAATAAAAAACAACAT1762                           ThrGluT hrPheTrpLysTrpSerLysTrpSerAsnLysLysGlnHis                              470475480                                                                      TTAACAACAGAAGCCAGTCCTTCAAAGGGGCCTGATACTTGGAGAGAG1810                           LeuThrThrGluAla SerProSerLysGlyProAsnThrTrpArgGlu                              485490495500                                                                   TGGAGTTCTGATGGAAAAAATTTAATAATCTATTGGAAGCCTTTACCC1858                           TrpSerSerAsp GlyLysAsnLeuIleIleTyrTrpLysProLeuPro                              505510515                                                                      ATTAATGAAGCTAATGGAAAAATACTTTCCTACAATGTATCGTGTTCA1906                           IleAsnGluAl aAsnGlyLysIleLeuSerTyrAsnValSerCysSer                              520525530                                                                      TCAGATGAGGAAACACAGTCCCTTTCTGAAATCCCTGATCCTCAGCAC1954                           SerAspGluGluT hrGlnSerLeuSerGluIleProAspProGlnHis                              535540545                                                                      AAAGCAGAGATACGACTTGATAAGAATGACTACATCATCAGCGTAGTG2002                           LysAlaGluIleArgLeu AspLysAsnAspTyrIleIleSerValVal                              550555560                                                                      GCTAAAAATTCTGTGGGCTCATCACCACCTTCCAAAATAGCGAGTATG2050                           AlaLysAsnSerValGlySerSerPro ProSerLysIleAlaSerMet                              565570575580                                                                   GAAATTCCAAATGATGATCTCAAAATAGAACAAGTTGTTGGGATGGGA2098                           GluIleProAsnAspAspLeuLy sIleGluGlnValValGlyMetGly                              585590595                                                                      AAGGGGATTCTCCTCACCTGGCATTACGACCCCAACATGACTTGCGAC2146                           LysGlyIleLeuLeuThrTrpH isTyrAspProAsnMetThrCysAsp                              600605610                                                                      TACGTCATTAAGTGGTGTAACTCGTCTCGGTCGGAACCATGCCTTATG2194                           TyrValIleLysTrpCysAsnSer SerArgSerGluProCysLeuMet                              615620625                                                                      GACTGGAGAAAAGTTCCCTCAAACAGCACTGAAACTGTAATAGAATCT2242                           AspTrpArgLysValProSerAsnSerThr GluThrValIleGluSer                              630635640                                                                      GATGAGTTTCGACCAGGTATAAGATATAATTTTTTCCTGTATGGATGC2290                           AspGluPheArgProGlyIleArgTyrAsnPhePheLe uTyrGlyCys                              645650655660                                                                   AGAAATCAAGGATATCAATTATTACGCTCCATGATTGGATATATAGAA2338                           ArgAsnGlnGlyTyrGlnLeuLeuArgSerMetI leGlyTyrIleGlu                              665670675                                                                      GAATTGGCTCCCATTGTTGCACCAAATTTTACTGTTGAGGATACTTCT2386                           GluLeuAlaProIleValAlaProAsnPheThr ValGluAspThrSer                              680685690                                                                      GCAGATTCGATATTAGTAAAATGGGAAGACATTCCTGTGGAAGAACTT2434                           AlaAspSerIleLeuValLysTrpGluAspIlePro ValGluGluLeu                              695700705                                                                      AGAGGCTTTTTAAGAGGATATTTGTTTTACTTTGGAAAAGGAGAAAGA2482                           ArgGlyPheLeuArgGlyTyrLeuPheTyrPheGlyLysGl yGluArg                              710715720                                                                      GACACATCTAAGATGAGGGTTTTAGAATCAGGTCGTTCTGACATAAAA2530                           AspThrSerLysMetArgValLeuGluSerGlyArgSerAspIleLys                                725730735740                                                                  GTTAAGAATATTACTGACATATCCCAGAAGACACTGAGAATTGCTGAT2578                           ValLysAsnIleThrAspIleSerGlnLysThrLeuArpIleAla Asp                              745750755                                                                      CTTCAAGGTAAAACAAGTTACCACCTGGTCTTGCGAGCCTATACAGAT2626                           LeuGinGlyLysThrSerTyrHisLeuValLeuArgAlaTyrThr Asp                              760765770                                                                      GGTGGAGTGGGCCCGGAGAAGAGTATGTATGTGGLGACAAAGGAAAAT2674                           GlyGlyValGlyProGluLysSerMetTyrValValThrLysGluAs n                              775780785                                                                      TCTGTGGGATTAATTATTGCCATTCTCATCCCAGTGGCAGTGGCTGTC2722                           SerValGlyLeuIleIleAlaIleLeuIleProValAlaValAlaVal                                790795800                                                                     ATTGTTGGAGTGGUGACAAGTATCCTTTGCTATCGGAAACGAGAATGG2770                           IleValGlyValValThrSerIleLeuCysTyrArgLysArgGluTrp                               805 810815820                                                                  ATTAAAGAAACCTTCTACCCTGATATTCCAAATCCAGAAAACTGTAAA2818                           IleLysGluThrPheTyrProAspIleProAsnProGluAsnCysLys                                825830835                                                                     GCATTACAGTTTCAAAAGAGTGTCTGTGAGGGAAGCAGTGCTCTTAAA2866                           AlaLeuGlnPheGlnLysSerValCysGluGlySerSerAlaLeuLys                                840845850                                                                     ACATTGGAAATGAATCCTTGTACCCCAAATAATGTTGAGGTTCTGGAA2914                           ThrLeuGluMetAsnProCysThrProAsnAsnValGluValLeuGlu                               855 860865                                                                     ACTCGATCAGCATTTCCTAAAATAGAAGATACAGAAATAATTTCCCCA2962                           ThrArgSerAlaPheProLysIleGluAspThrGluIleIleSerPro                               870 875880                                                                     GTAGCTGAGCGTCCTGAAGATCGCTCTGATGCAGAGCCTGAAAACCAT3010                           ValAlaGluArgProGluAspArgSerAspAlaGluProGluAsnHis                               885890 895900                                                                  GTGGTTGTGTCCTATTGTCCACCCATCATTGAGGAAGAAATACCAAAC3058                           ValValValSerTyrCysProProIleIleGluGluGluIleProAsn                               905 910915                                                                     CCAGCCGCAGATGAACCTGGAGGGACTGCACAGGTTATTTACATTGAT3106                           ProAlaAlaAspGluAlaGlyGlyThrAlaGlnValIleTyrIleAsp                               920 925930                                                                     GTTCAGTCGATGTATCAGCCTCAAGCAAAACCAGAAGAAAAAAAAAAA3154                           ValGlnSerMetTyrGlnProGlnAlaLysProGluGluLysLysLys                               935 940945                                                                     AAAAGCAGGTCGTCTCGTTCCAAGATCT3182                                               LysSerArgSerSerArgSerLysIle                                                    950955                                                                         (2) INFORMATION FOR SEQ ID NO:2:                                               (i) SEQUENCE CHARACTERISTICS:                                                   (A) LENGTH: 1001 amino acids                                                  (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                        MetMetAspIleTyrValCysLeuLysArgProSerTrpMetValAsp                               44- 40-35 -30                                                                  AsnLysArgMetArgThrAlaSerAsnPheGlnTrpLeuLeuSerThr                               25-20- 15                                                                      PheIleLeuLeuTyrLeuMetAsnGlnValAsnSerGlnLysLysGly                               10-51                                                                          AlaProHisAspLeuLysCysValThrAsnAsnLeuGlnValTrpAsn                               5101520                                                                        CysSerTrpLy sAlaProSerGlyThrGlyArgGlyThrAspTyrGlu                              253035                                                                         ValCysIleGluAsnArgSerArgSerCysTyrGlnLeuGluLysThr                               40 4550                                                                        SerIleLysIleProAlaLeuSerHisGlyAspTyrGluIleThrIle                               556065                                                                         AsnSerLeuHisAsnPheGlySerSerThrSerL ysPheThrLeuAsn                              707580                                                                         GluGlnAsnValSerLeuIleProAspThrProGluIleLeuAsnLeu                               859095100                                                                       SerAlaAsnPheSerThrSerThrLeuTyrLeuLysTrpAsnAspArg                              105110115                                                                      GlySerValPheProHisArgSerAsnValIleTrpGluIleLysVal                                120125130                                                                     LeuArgLysGluSerMetGluLeuValLysLeuValThrHisAsnThr                               135140145                                                                      ThrLeuAsnGlyLysAspThrLe uHisHisTrpSerTrpAlaSerAsp                              150155160                                                                      MetProLeuGluCysAlaIleHisPheValGluIleArgCysTyrIle                               165170175 180                                                                  AspAsnLeuHisPheSerGlyLeuGluGluTrpSerAsnTrpSerPro                               185190195                                                                      ValLysAsnIleSerTrpIleProAsnSerGlnThrLysValP hePro                              200205210                                                                      GlnAspLysValIleLeuValGlySerAsnIleThrPheCysCysVal                               215220225                                                                      SerGlnGluLys ValLeuSerAlaLeuIleGlyHisThrAsnCysPro                              230235240                                                                      LeuIleHisLeuAspGlyGluAsnValAlaIleLysIleArgAsnIle                               250255 260                                                                     SerValSerAlaSerSerGlyThrAsnValValPheThrThrGluAsp                               265270275                                                                      AsnIlePheGlyThrValIlePheAlaGlyTyrProProAspThrPr o                              280285290                                                                      GlnGlnLeuAsnCysGluThrHisAspLeuLysGluIleIleCysSer                               295300305                                                                      TrpAsnProGlyArg ValThrAlaLeuValGlyProArgAlaThrSer                              310315320                                                                      TyrThrLeuValGluSerPheSerGlyLysTyrValArgLeuLysArg                               330335 340                                                                     AlaGluAlaProThrAsnGluSerTyrGlnLeuLeuPheGlnMetLeu                               345350355                                                                      ProAsnGlnGluIleTyrAsnPheThrLeuAsnAlaHisAsnProLeu                                360365370                                                                     GlyArgSerGlnSerThrIleLeuValAsnIleThrGluLysValTyr                               375380385                                                                      ProHisThrProThrSerP heLysValLysAspIleAsnSerThrAla                              390395400                                                                      ValLysLeuSerTrpHisLeuProGlyAsnPheAlaLysIleAsnPhe                               410425420                                                                       LeuCysGluIleGluIleLysLysSerAsnSerValGlnGluGlnArg                              425430435                                                                      AsnValThrIleLysGlyValGluAsnSerSerTyrLeuValAlaLeu                                440445450                                                                     AspLysLeuAsnProTyrThrLeuTyrThrPheArgIleArgCysSer                               455460465                                                                      ThrGluThrPheTrpLysTrpSe rLysTrpSerAsnLysLysGlnHis                              470475480                                                                      LeuThrThrGluAlaSerProSerLysGlyProAspThrTrpArgGlu                               490495500                                                                      Trp SerSerAspGlyLysAsnLeuIleIleTyrTryLysProLeuPro                              505510515                                                                      IleAsnGluAlaAsnGlyLysIleLeuSerTyrAsnValSerCysSer                                520525530                                                                     SerAspGluGluThrGlnSerLeuSerGluIleProAspProGlnHis                               535540545                                                                      LysAlaGluIleArgLeuAspLysAsn AspTyrIleIleSerValVal                              550555560                                                                      AlaLysAsnSerValGlySerSerProProSerLysIleAlaSerMet                               565570575 580                                                                  GluIleProAsnAspAspLeuLysIleGluGlnValValGlyMetGly                               585590595                                                                      LysGlyIleLeuLeuThrTrpHisTyrAspProAsnMetThrCysAs p                              600605610                                                                      TyrValIleLysTrpCysAsnSerSerArgSerGluProCysLeuMet                               615620625                                                                      AspTrpArgLysVal ProSerAsnSerThrGluThrValIleGluSer                              630635640                                                                      AspGluPheArgProGlyIleArgTyrAsnPhePheLeuTyrGlyCys                               650655 660                                                                     ArgAsnGlnGlyTyrGlnLeuLeuArgSerMetIleGlyTyrIleGlu                               665670675                                                                      GluLeuAlaProIleValAlaProAsnPheThrValGluAsnThrSer                                680685690                                                                     AlaAspSerIleLeuValLysTrpGluAspIleProValGluGluLeu                               695700705                                                                      ArgGlyPheLeuArgGlyT yrLeuPheTyrPheGlyLysGlyGluArg                              710715720                                                                      AspThrSerLysMetArgValLeuGluSerGlyArgSerAspIleLys                               730735740                                                                       ValLysAsnIleThrAspIleSerGlnLysThrLeuArgIleAlaAsp                              745750755                                                                      LeuGlnGlyLysThrSerTyrHisLeuValLeuArgAlaTyrThrAsp                                760765770                                                                     GlyGlyValGlyProGluLysSerMetTyrValValThrLysGluAsn                               775780785                                                                      SerValGlyLeuIleIleAlaIl eLeuIleProValAlaValAlaVal                              790795800                                                                      IleValGlyValValThrSerIleLeuCysTyrArgLysArgGluTrp                               810815820                                                                      Ile LysGluThrPheTyrProAsnIleProAsnProGluAsnCysLys                              825830835                                                                      AlaLeuGlnPheGlnLysSerValCysGluGlySerSerAlaLeuLys                                840845850                                                                     ThrLeuGluMetAsnProCysThrProAsnAsnValGluValLeuGlu                               855860865                                                                      ThrArgSerAlaPheProLysIleGlu AspThrGluIleIleSerPro                              870875880                                                                      ValAlaGluArgProGluAspArgSerAspAlaGluProGluAsnHis                               885890895 900                                                                  ValValValSerTyrCysProProIleIleGluGluGluIleProAsn                               905910915                                                                      ProAlaAlaAspGluAlaGlyGlyThrAlaGlnValIleTyrIleAs p                              920925930                                                                      ValGlnSerMetTyrGlnProGlnAlaLysProGluGluLysLysLys                               935940945                                                                      LysSerArgSerSer ArgSerLysIle                                                   950955                                                                         (2) INFORMATION FOR SEQ ID NO:3:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 2498 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: CDNA to mRNA                                               (iii) HYPOTHETICAL: N                                                          (iv) ANTISENSE: N                                                              ( vii) IMMEDIATE SOURCE:                                                       (B) CLONE: mulifr3                                                             (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 532212                                                           (D) OTHER INFORMATION:                                                         (ix) FEATURE:                                                                  (A) NAME/KEY: matpeptide                                                       (B) LOCATION: 1822209                                                          (D) OTHER INFORMATION:                                                         (ix) FEATURE:                                                                  (A) NAME/KEY: sigpeptide                                                        (B) LOCATION: 53181                                                           (D) OTHER INFORMATION:                                                         (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                        CCCCCTCCGTGGCATTGGCTCCTGCCCAGGGGCTGACTGAACAGCAAGGACAATG55                      Met                                                                            43                                                                             GCAGCTTACTCATGGTGGAGACAGCCATCGTGGATGGTAGACAATAAA103                            AlaAlaTyrSerTrpTrpArgGlnProSerTrpMetValAspAsnLys                               40 -35-30                                                                      AGATCGAGGATGACTCCAAACCTGCCATGGCTCCTGTCAGCTCTGACC151                            ArgSerArgMetThrProAsnLeuProTrpLeuLeuSerAlaLeuThr                               25 -20-15                                                                      CTCCTGCATCTGACGATGCATGCAAACGGTCTGAAGAGAGGGGTACAA199                            LeuLeuHisLeuThrMetHisAlaAsnGlyLeuLysArgGlyValGln                               10-5 15                                                                        GACTTGAAATGCACAACCAACAACATGCGAGTGTGGGACTGCACGTGG247                            AspLeuLysCysThrThrAsnAsnMetArgValTrpAspCysThrTrp                               10 1520                                                                        CCAGCTCCCCTCGGGGTCAGCCCTGGAACTGTTAAAGATATTTGCATT295                            ProAlaProLeuGlyValSerProGlyThrValLysAspIleCysIle                               2530 35                                                                        AAAGACAGGTTCCATTCTTGTCACCCATTAGAGACAACAAACGTTAAA343                            LysAspArgPheHisSerCysHisProLeuGluThrThrAsnValLys                               4045 50                                                                        ATTCCAGCTCTTTCACCTGGTGATCACGAAGTCACAATAAATTATCTA391                            IleProAlaLeuSerProGlyAspHisGluValThrIleAsnTyrLeu                               556065 70                                                                      AATGGCTTTCAGAGTAAATTCACGTTGAATGAAAAAGATGTCTCTTTA439                            AsnGlyPheGlnSerLysPheThrLeuAsnGluLysAspValSerLeu                               7580 85                                                                        ATTCCAGAGACTCCCGAGATCCTGGATTTGTCTGCTGACTTCTTCACC487                            IleProGluThrProGluIleLeuAsnLeuSerAlaAspPhePheThr                               9095 100                                                                       TCCTCCTTACTACTGAAGTGGAACGACAGAGGGTCTGCTCTGCCTCAC535                            SerSerLeuLeuLeuLysTrpAsnAspArgGlySerAlaLeuProHis                               1051101 15                                                                     CCCTCCAATGCCACCTGGGAGATTAAGGTTCTACAGAATCCAAGGACG583                            ProSerAsnAlaThrTrpGluIleLysValLeuGlnAsnProArgThr                               120125130                                                                      GAACC AGTAGCACTCGTGTTACTCLACACAATGCTGAGTGGTAAAGAT631                           GluProValAlaLeuValLeuLeuAsnThrMetLeuSerGlyLysAsp                               135140145150                                                                   A CCGTTCAGCACTGGAACTGGACCTCAGACCTGCCCTTGCAATGTGCC679                           ThrValGlnHisTrpAsnTrpThrSerAspLeuProLeuGlnCysAla                               155160165                                                                       ACTCACTCGGTGAGCATTCGATGGCACATTGACTCGCCTCATTTCTCC727                           ThrHisSerValSerIleArgTrpHisIleAspSerProHisPheSer                               170175180                                                                      GGT TACAAAGAGTGGAGTGACTGGAGCCCGCTGLAGAACATCTCCTGG775                           GlyTyrLysGluTrpSerAspTrpSerProLeuLysAsnIleSerTrp                               185190195                                                                      ATTCGTAA TACAGAGACTAATGTTTTTCCTCAAGACAAAGTGGTGCTC823                           IleArgAsnThrGluThrAsnValPheProGlnAsnLysValValLeu                               200205210                                                                      GCAGGCTCAAACATGA CAATTTGTTGTATGAGTCCAACGAAAGTGCTT871                           AlaGlySerAsnMetThrIleCysCysMetSerProThrLysValLeu                               215220225230                                                                   TCAGGACAGATC GGCAATACCCTTCGTCCTCTCATCCATCTGTACGGG919                           SerGlyGlnIleGlyAsnThrLeuArgProLeuIleHisLeuTyrGly                               235240245                                                                      CAAACCGTTGCG ATCCATATCCTGAACATCCCCGTTTCTGAAAACAGT967                           GlnThrValAlaIleHisIleLeuAsnIleProValSerGluAsnSer                               250255260                                                                      GGCACAAACATCAT TTTCATCACAGACGACGATGTGTACGGAACGGTG1015                          GlyThrAsnIleIlePheIleThrAspAspAspValTyrGlyThrVal                               265270275                                                                      GTCTTTGCAGGCTATCCTC CCGATGTTCCTCAGAAGCTGAGCTGTGAG1063                          ValPheAlaGlyTyrProProAspValProGlnLysLeuSerCysGlu                               280285290                                                                      ACACATGACTTAAAAGAGATTATATGT AGCTGGAATCCAGGAAGGATA1111                          ThrHisAspLeuLysGluIleIleCysSerTrpAsnProGlyArgIle                               295300305310                                                                   ACTGGACTGGTGGGCCCACGAAAT ACAGAATACACCCTGTTTGAAAGC1159                          ThrGlyLeuValGlyProArgAsnThrGluTyrThrLeuPheGluSer                               315320325                                                                      ATTTCAGGAAAATCGGCAGTATA TCACAGGATTGAAGGACTTACAAAC1207                          IleSerGlyLysSerAlaValPheHisArgIleGluGlyLeuThrAsn                               330335340                                                                      GAGACCTACCGGTTAGGCGTGCAAA TGCATCCCGGCCAAGAAATCCAT1255                          GluThrTyrArgLeuGlyValGlnMetHisProGlyGlnGluIleHis                               345350355                                                                      AACTTCACCCTGACTGGTCGCAATCCACTG GGGCAGGCACAGTCAGCA1303                          AsnPheThrLeuThrGlyArgAsnProLeuGlyGlnAlaGlnSerAla                               360365370                                                                      GTGGTCATCAATGTGACTGAGAGAGTTGCTCCTCATGAT CCGACTTCG1351                          ValValIleAsnValThrGluArgValAlaProHisAspProThrSer                               375380385390                                                                   TTGAAAGTGAAGGACATCAATTCAACAGTTGTTAC ATTTTCTTGGTAT1399                          LeuLysValLysAspIleAsnSerThrValValThrPheSerTrpTyr                               395400405                                                                      TTACCAGGAAATTTTACAAAGATTAATCTTTTAT GTCAAATTGAAATT1447                          LeuProGlyAsnPheThrLysIleAsnLeuLeuCysGlnIleGluIle                               410415420                                                                      TGTALAGCTAATTCCAAGAAAGAAGTGAGGAATGCC ACAATCAGAGGA1495                          CysLysAlaAsnSerLysLysGluValArgAsnAlaThrIleArgGly                               425430435                                                                      GCCGAGGATTCAACTTACCATGTTGCTGTAGACAAATTAAAT CCATAC1543                          AlaGluAspSerThrTyrHisValAlaValAspLysLeuAsnProTyr                               440445450                                                                      ACTGCATACACTTTCCGGGTTCGTTGTTCTTCCAAGACTTTCTGGAAG 1591                          ThrAlaTyrThrPheArgValArgCysSerSerLysThrPheTrpLys                               455460465470                                                                   TGGAGCAGGTGGAGTGATGAGAAGCGACATCTAACCACAGAAGCCA CT1639                          TrpSerArgTrpSerAspGluLysArgHisLeuThrThrGluAlaThr                               475490485                                                                      CCTTCAAAGGGACCAGACACTTGGAGAGAGTGGAGTTCTGATGGA AAA1687                          ProSerLysGlyProAspThrTrpArgGluTrpSerSerAspGlyLys                               490495500                                                                      AATCTAATCGTCTACTGGAAGCCTTTACCTATTAATGAAGCTAATGGA 1735                          AsnLeuIleValTyrTrpLysProLeuProIleAsnGluAlaAsnGly                               505510515                                                                      AAAATACTTTCCTACAATGTTTCGTGTTCATTGAACGAGGAGACACAG17 83                          LysIleLeuSerTyrAsnValSerCysSerLeuAsnGluGluThrGln                               520525530                                                                      TCAGTTTTGGAGATCTTCGATCCTCAACACAGAGCAGAGATACAGCTT1831                           SerVal LeuGluIlePheAspProGlnHisArgAlaGluIleGlnLeu                              535540545550                                                                   AGTAAAAATGACTACATCATCAGTGTGGTGGCAAGAAATTCTGCTGGC1879                           Se rLysAsnAspTyrIleIleSerValValAlaArgAsnSerAlaGly                              555560565                                                                      TCATCACCACCTTCGAAAATAGCTAGTATGGAAATCCCAAATGACGAC1927                           S erSerProProSerLysIleAlaSerMetGluIleProAsnAspAsp                              570575580                                                                      ATCACAGTAGAGCAAGCGGTGGGGCTAGGAAACACGATCTTCCTCACC1975                           Ile ThrValGluGlnAlaValGlyLeuGlyAsnArgIlePheLeuThr                              585590595                                                                      TGGCGTCACGACCCCAACATGACTTGTGACTACGTAATTAAATGGTCC2023                           TrpArgHis AspProAsnMetThrCysAspTyrValIleLysTrpCys                              600605610                                                                      AACTCATCTCGGTCTGAGCCCTGCCTCCTGGACTGGAGAAAGGTTCCT2071                           AsnSerSerArgSerGl uProCysLeuLeuAspTrpArgLysValPro                              620625630                                                                      TCALACAGCACGGAGACTGTCATAGAGTCTGATCAGTTTCAGCCAGGA2119                           SerAsnSerThrGluThrValIleGluS erAspGlnPheGlnProGly                              635640645                                                                      GTAAGATACAACTTTTACCTCTATGGGTGCACTAACCAGCGATACCAA2167                           ValArgTyrAsnPheTyrLeuTyrGly CysThrAsnGlnGlyTyrGln                              650655660                                                                      CTGTTACGTTCCATAATTGGATACGTAGAAGAACTGGAAGCTTAA2212                              LeuLeuArgSerIleIleGlyTyrValGlu GluLeuGluAla                                    665670675                                                                      AAACTTGGAAATGTATCCAGGCCTAACACCAGAGAGGGGAGTATCCCTGAAGTCTGTTTG2272               AGCGGTCACTTAAAATATGCGGCACATGGGGGGCTGGAGAGAAGGCACC GACTGCTCTTC2332              CAGAGGTCCTGAGTTCAATTCCCAGCAACCACATGGTGACTCACAACCATCTGTAATGGG2392               GTCTGGTGCCCTCTTCTGGTGTGTCTGAAGAGAGCAATGGTGGCATACTCATATGTATAA2452               AATAAATAAATAAAACTTTTTAAAAA ACCAAAAAAAAAAAAAAAAA2498                            (2) INFORMATION FOR SEQ ID NO:4:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 719 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                        MetAlaAlaTyrSerTrpTrpArgGlnPr oSerTryMetValAspAsn                              43-40-35- 30                                                                   LysArgSerArgMetThrProAsnLeuProTrpLeuLeuSerAlaLeu                               25-20- 15                                                                      ThrLeuLeuHisLeuThrMetHisAlaAsnGlyLeuLysArgGlyVal                               10- 515                                                                        GlnAspLeuLysCysThrThrAsnAsnMetArgValTroAspCysThr                               101520                                                                         TrpProAlaProLeuGlyValSerProGlyThrValLysAspIleCys                               253035                                                                         IleLysAspA rgPheHisSerCysHisProLeuGluThrThrAsnVal                              404550                                                                         LysIleProAlaLeuSerProGlyAspHisGluValThrIleAsnTyr                               5560 65                                                                        LeuAsnGlyPheGlnSerLysPheThrLeuAsnGluLysAspValSer                               70758085                                                                       LeuIleProGluThrProGluIleLeuAspLeuSer AlaAspPhePhe                              9095100                                                                        ThrSerSerLeuLeuLeuLysTrpAsnAspArgGlySerAlaLeuPro                               105110115                                                                      HisProSerAsnAlaThrTrpGluIleLysValLeuGlnAsnProArg                               120125130                                                                      ThrGluProValAlaLeuValLeuLeuAsnThrMetLeuSerGlyLys                               135 140145                                                                     AspThrValGlnHisTrpAsnTrpThrSerAspLeuProLeuGlnCys                               150155160165                                                                   AlaThrHisSerValSerIleArgT rpHisIleAsnSerProHisPhe                              170175180                                                                      SerGlyTyrLysGluTrpSerAspTrpSerProLeuLysAsnIleSer                               185190 195                                                                     TrpIleArgAsnThrGluThrAsnValPheProGlnAspLysValVal                               200205210                                                                      LeuAlaGlySerAsnMetThrIleCysCysMetSerProThrLysVal                                215220225                                                                     LeuSerGlyGlnIleGlyAsnThrLeuArgProLeuIleHisLeuTyr                               230235240245                                                                   GlyGlnThrValAl aIleHisIleLeuAsnIleProValSerGluAsn                              250255260                                                                      SerGlyThrAsnIleIlePheIleThrAspAspAspValTyrGlyThr                               265 270275                                                                     ValValPheAlaGlyTyrProProAspValProGlnLysLeuSerCys                               280285290                                                                      GluThrHisAspLeuLysGluIleIleCysSerTrpA snProGlyArg                              295300305                                                                      IleThrGlyLeuValGlyProArgAsnThrGluTyrThrLeuPheGlu                               310315320325                                                                   Ser IleSerGlyLysSerAlaValPheHisArgIleGluGlyLeuThr                              330335340                                                                      AsnGluThrTyrArgLeuGlyValGlnMetHisProGlyGlnGluIle                                345350355                                                                     HisAsnPheThrLeuThrGlyArgAsnProLeuGlyGlnAlaGlnSer                               360365370                                                                      AlaValValIleAsnValThrGluAr gValAlaProHisAspProThr                              375380385                                                                      SerLeuLysValLysAspIleAsnSerThrValValThrPheSerTrp                               390395400 405                                                                  TyrLeuProGlyAsnPheThrLysIleAsnLeuLeuCysGlnIleGlu                               410415420                                                                      IleCysLysAlaAsnSerLysLysGluValArgAsnAlaThrIleA rg                              425430435                                                                      GlyAlaGluAspSerThrTyrHisValAlaValAspLysLeuAsnPro                               440445450                                                                      TyrThrAlaTyrThr PheArgValArgCysSerSerLysThrPheTrp                              455460465                                                                      LysTrpSerArgTrpSerAspGluLysArgHisLeuThrThrGluAla                               470475 480485                                                                  ThrProSerLysGlyProAspThrTrpArgGluTrpSerSerAspGly                               490495500                                                                      LysAsnLeuIleValTyrTrpLysProLeuProIl eAsnGluAlaAsn                              505510515                                                                      GlyLysIleLeuSerTyrAsnValSerCysSerLeuAsnGluGluThr                               520525530                                                                      Gln SerValLeuGluIlePheAsnProGlnHisArgAlaGluIleGln                              535540545                                                                      LeuSerLysAsnAspTyrIleIleSerValValAlaArgAsnSerAla                               550555 560565                                                                  GlySerSerProProSerLysIleAlaSerMetGluIleProAsnAsp                               570575580                                                                      AspIleThrValGluGlnAlaVal GlyLeuGlyAsnArgIlePheLeu                              585590595                                                                      ThrTrpArgHisAspProAsnMetThrCysAspTyrValIleLysTrp                               600605 610                                                                     CysAsnSerSerArgSerGluProCysLeuLeuAspTrpArgLysVal                               615620625                                                                      ProSerAsnSerThrGluThrValIleGluSerAspGlnPheGlnPro                               630 635640645                                                                  GlyValArgTyrAsnPheTyrLeuTyrGlyCysThrAsnGlnGlyTyr                               650655660                                                                      GlnLeuLeuArg SerIleIleGlyTyrValGluGluLeuGluAla                                 665670675                                                                      (2) INFORMATION FOR SEQ ID NO:5:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 3591 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: CDNA to mRNA                                               (iii) HYPOTHETICAL: NO                                                         (iv) ANTISENSE: NO                                                             (vii) IMMEDIATE SOURCE:                                                        (B) CLONE: hulifr.65- gen                                                      (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 179..3492                                                        (ix) FEATURE:                                                                  (A) NAME/KEY: matpeptide                                                       (B) LOCATION: 311..3469                                                        (ix) FEATURE:                                                                   (A) NAME/KEY: sigpeptide                                                      (B) LOCATION: 179..310                                                         (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                        AGATCTTGGAACGAGACGACCTGCTCTCTCTCCCAGAACGTGTCTCTGCTGCAAGGCACC60                 GGGCCCTTTCGCTCTGCAGAACTGCACTTGCAAGACCATTATCAACTCCTAATCCCAGCT120                CAGAA AGGGAGCCTCTGCGACTCATTCATCGCCCTCCAGGACTGACTGCATTGCACAG178                 ATGATGGATATTTACGTATGTTTGAAACGACCATCCTGGATGGTGGAC226                            MetMetAspIleTyrValCysLeuLysArgProSerTrpMet ValAsp                              44- 40-35-30                                                                   AATAAAAGAATGAGGACTGCTTCAAATTTCCAGTGGCTGTTATCAACA274                            AsnLysArgMetArgThrAlaSerAsnPheGlnTrpLeuLe uSerThr                              25-20- 15                                                                      TTTATTCTTCTATATCTAATGAATCAAGTAAATAGCCAGAAAAAGGGG322                            PheIleLeuLeuTyrLeuMetAsnGlnValAsnSerGlnLys LysGly                              10-51                                                                          GCTCCTCATGATTTGAAGTGTGTAACTAACAATTTGCAAGTGTGGAAC370                            AlaProHisAspLeuLysCysValThrAsnAsnLeuGlnValTrpAsn                               5101520                                                                        TGTTCTTGGAAAGCACCCTCTGGAACAGGCCGTGGTACTGATTATGAA418                            CysSerTrpLysAlaProSerGlyThrGlyArgGlyThrAspTy rGlu                              253035                                                                         GTTTGCATTGAAAACAGGTCCCGTTCTTGTTATCAGTTGGAGAAAACC466                            ValCysIleGluAsnArgSerArgSerCysTyrGlnLeuGluL ysThr                              404550                                                                         AGTATTAAAATTCCAGCTCTTTCACATGGTGATTATGAAATAACAATA514                            SerIleLysIleProAlaLeuSerHisGlyAspTyrGluIleThr Ile                              556065                                                                         AATTCTCTACATGATTTTGGAAGTTCTACAAGTAAATTCACACTAAAT562                            AsnSerLeuHisAspPheGlySerSerThrSerLysPheThrLeuAsn                                707580                                                                        GAACAAAACGTTTCCTTAATTCCAGATACTCCAGAGATCTTGAATTTG610                            GluGlnAsnValSerLeuIleProAspThrProGluIleLeuAsnLeu                               85 9095100                                                                     TCTGCTGATTTCTCAACCTCTACATTATACCTAAAGTGGAACGACAGG658                            SerAlaAspPheSerThrSerThrLeuTyrLeuLysTrpAsnAspArg                                105110115                                                                     GGTTCAGTTTTTCCACACCGCTCAAATGTTATCTGGGAAATTAAAGTT706                            GlySerValPheProHisArgSerAsnValIleTrpGluIleLysVal                                120125130                                                                     CTACGTAAAGAGAGTATGGAGCTCGTAAAATTAGTGACCCACAACACA754                            LeuArgLysGluSerMetGluLeuValLysLeuValThrHisAsnThr                               13 5140145                                                                     ACTCTGAATGGCAAAGATACACTTCATCACTGGAGTTGGGCCTCAGAT802                            ThrLeuAsnGlyLysAspThrLeuHisHisTrpSerTrpAlaSerAsp                               150 155160                                                                     ATGCCCTTGGAATGTGCCATTCATTTTGTGGAAATTAGATGCTACATT850                            MetProLeuGluCysAlaIleHisPheValGluIleArgCysTyrIle                               165170 175180                                                                  GACAATCTTCATTTTTCTGGTCTCGAAGAGTGGAGTGACTGGAGCCCT898                            AspAsnLeuHisPheSerGlyLeuGluGluTrpSerAspTrpSerPro                               185 190195                                                                     GTGAAGAACATTTCTTGGATACCTGATTCTCAGACTAAGGTTTTTCCT946                            ValLysAsnIleSerTrpIleProAspSerGlnThrLysValPhePro                               200 205210                                                                     CAAGATAAAGTGATACTTGTAGGCTCAGACATAACATTTTGTTGTGTG994                            GlnAspLysValIleLeuValGlySerAspIleThrPheCysCysVal                               215 220225                                                                     AGTCAAGAAAAAGTGTTATCAGCACTGATTGGCCATACAAACTGCCCC1042                           SerGlnGluLysValLeuSerAlaLeuIleGlyHisThrAsnCysPro                               230235 240                                                                     TTGATCCATCTTGATGGGGAAAATGTTGCAATCAAGATTCGTAATATT1090                           LeuIleHisLeuAspGlyGluAsnValAlaIleLysIleArgAsnIle                               245250255 260                                                                  TCTGTTTCTGCAAGTAGTGGAACAAATGTAGTTTTTACAACCGAAGAT1138                           SerValSerAlaSerSerGlyThrAsnValValPheThrThrGluAsp                               26527 0275                                                                     AACATATTTGGAACCGTTATTTTTGCTGGATATCCACCAGATACTCCT1186                           AsnIlePheGlyThrValIlePheAlaGlyTyrProProAspThrPro                               280285 290                                                                     CAACAACTGAATTGTGAGACACATGATTTAAAAGAAATTATATGTAGT1234                           GlnGlnLeuAsnCysGluThrHisAspLeuLysGluIleIleCysSer                               295300 305                                                                     TGGAATCCAGGAAGGGTGACAGCGTTGGTGGGCCCACGTGCTACAAGC1282                           TrpAsnProGlyArgValThrAlaLeuValGlyProArgAlaThrSer                               310315320                                                                      TACACTTTAGTTGAAAGTTTTTCAGGAAAATATGTTAGACTTAAAAGA1330                           TyrThrLeuValGluSerPheSerGlyLysTyrValArgLeuLysArg                               325330335 340                                                                  GCTGAAGCACCTACAAACGAAAGCTATCAATTATTATTTCAAATGCTT1378                           AlaGluAlaProThrAsnGluSerTyrGlnLeuLeuPheGlnMetLeu                               345350 355                                                                     CCAAATCAAGAAATATATAATTTTACTTTGAATGCTCACAATCCGCTG1426                           ProAsnGlnGluIleTyrAsnPheThrLeuAsnAlaHisAsnProLeu                               360365 370                                                                     GGTCGATCACAATCAACAATTTTAGTTAATATAACTGAAAAAGTTTAT1474                           GlyArgSerGlnSerThrIleLeuValAsnIleThrGluLysValTyr                               375380385                                                                      C CCCATACTCCTACTTCATTCAAAGTGAAGGATATTAATTCAACAGCT1522                          ProHisThrProThrSerPheLysValLysAspIleAsnSerThrAla                               390395400                                                                      GTTAAACTT TCTTGGCATTTACCAGGCAACTTTGCAAAGATTAATTTT1570                          ValLysLeuSerTrpHisLeuProGlyAsnPheAlaLysIleAsnPhe                               405410415420                                                                   TTATGT GAAATTGAAATTAAGAAATCTAATTCAGTACAAGAGCAGCGG1618                          LeuCysGluIleGluIleLysLysSerAsnSerValGlnGluGlnArg                               425430435                                                                      AATGT CACAATCAAAGGAGTAGAAAATTCAAGTTATCTTGTTGCTCTG1666                          AsnValThrIleLysGlyValGluAsnSerSerTyrLeuValAlaLeu                               440445450                                                                      GACAAGT TAAATCCATACACTCTATATACTTTTCGGATTCGTTGTTCT1714                          AspLysLeuAsnProTyrThrLeuTyrThrPheArgIleArgCysSer                               455460465                                                                      ACTGAAACTTTC TGGAAATGGAGCAAATGGAGCAATAAAAAACAACAT1762                          ThrGluThrPheTrpLysTrpSerLysTrpSerAsnLysLysGlnHis                               470475480                                                                      TTAACAACAGAAGCCAGTCCT TCAAAGGGGCCTGATACTTGGAGAGAG1810                          LeuThrThrGluAlaSerProSerLysGlyProAspThrTrpArgGlu                               485490495500                                                                   TGGAGTTCTGATGGAAA AAATTTAATAATCTATTGGAAGCCTTTACCC1858                          TrpSerSerAspGlyLysAsnLeuIleIleTyrTrpLysProLeuPro                               505510515                                                                      ATTAATGAAGCTAATG GAAAAATACTTTCCTACAATGTATCGTGTTCA1906                          IleAsnGluAlaAsnGlyLysIleLeuSerTyrAsnValSerCysSer                               520525530                                                                      TCAGATGAGGAAACACAG TCCCTTTCTGAAATCCCTGATCCTCAGCAC1954                          SerAspGluGluThrGlnSerLeuSerGluIleProAspProGlnHis                               535540545                                                                      AAAGCAGAGATACGACTTGATAAG AATGACTACATCATCAGCGTAGTG2002                          LysAlaGluIleArgLeuAspLysAsnAspTyrIleIleSerValVal                               550555560                                                                      GCTAAAAATTCTGTGGGCTCATCACCACCTTC CAAAATAGCGAGTATG2050                          AlaLysAsnSerValGlySerSerProProSerLysIleAlaSerMet                               565570575580                                                                   GAAATTCCAAATGATGATCTCAAAATAG AACAAGTTGTTGGGATGGGA2098                          GluIleProAsnAspAspLeuLysIleGluGlnValValGlyMetGly                               585590595                                                                      AAGGGGATTCTCCTCACCTGGCATTAC GACCCCAACATGACTTGCGAC2146                          LysGlyIleLeuLeuThrTrpHisTyrAspProAsnMetThrCysAsp                               600605610                                                                      TACGTCATTAAGTGGTGTAACTCGTCTCGG TCGGAACCATGCCTTATG2194                          TyrValIleLysTrpCysAsnSerSerArgSerGluProCysLeuMet                               615620625                                                                      GACTGGAGAALAGTTCCCTCAAACAGCACTGAAAC TGTAATAGAATCT2242                          AspTrpArgLysValProSerAsnSerThrGluThrValIleGluSer                               630635640                                                                      CATGAGTTTCGACCAGGTATAAGATATAATTTTTTCCTGTATG GATGC2290                          AspGluPheArgProGlyIleArgTyrAsnPhePheLeuTyrGlyCys                               645650655660                                                                   AGAAATCAAGGATATCAATTATTACGCTCCATGATTGGA TATATAGAA2338                          ArgAsnGlnGlyTyrGlnLeuLeuArgSerMetIleGlyTyrIleGlu                               665670675                                                                      GAATTGGCTCCCATTGTTGCACCAAATTTTACTGTTGAG GATACTTCT2386                          GluLeuAlaProIleValAlaProAsnPheThrValGluAspThrSer                               680685690                                                                      GCAGATTCGATATTAGTAAAATGGGAAGACATTCCTGTGGA AGAACTT2434                          AlaAspSerIleLeuValLysTrpGluAspIleProValGluGluLeu                               695700705                                                                      AGAGGCTTTTTAAGAGGATATTTGTTTTACTTTGGAAAAGGAGAAA GA2482                          ArgGlyPheLeuArgGlyTyrLeuPheTyrPheGlyLysGlyGluArg                               710715720                                                                      GACACATCTAAGATGAGGGTTTTAGAATCAGGTCGTTCTGACATAAAA2530                           AspThrSerLysMetArgValLeuGluSerGlyArgSerAspIleLys                               725730735740                                                                   GTTAAGAATATTACTGACATATCCCAGAAGACACTGAGAATTGCTGAT 2578                          ValLysAsnIleThrAspIleSerGlnLysThrLeuArgIleAlaAsp                               745750755                                                                      CTTCAAGGTAAAACAAGTTACCACCTGGTCTTGCGAGCCTATACAGAT 2626                          LeuGlnGlyLysThrSerTyrHisLeuValLeuArgAlaTyrThrAsp                               760765770                                                                      GGTGGAGTGGGCCCGGAGAAGAGTATGTATGTGGTGACAAAGGAAAAT2 674                          GlyGlyValGlyProGluLysSerMetTyrValValThrLysGluAsn                               775780785                                                                      TCTGTGGGATTAATTATTGCCATTCTCATCCCAGTGGCAGTGGCTGTC2722                           Se rValGlyLeuIleIleAlaIleLeuIleProValAlaValAlaVal                              790795800                                                                      ATTGTTGGAGTGGTGACAAGTATCCTTTGCTATCGGAAACGAGAATGG2770                           IleValGlyV alValThrSerIleLeuCysTyrArgLysArgGluTrp                              805810815820                                                                   ATTAAAGAAACCTTCTACCCTGATATTCCAAATCCAGAAAACTGTAAA2818                           IleLys GluThrPheTyrProAspIleProAsnProGluAsnCysLys                              825830835                                                                      GCATTACAGTTTCAAAAGAGTGTCTGTGAGGGAAGCAGTGCTCTTAAA2866                           AlaLeu GlnPheGlnLysSerValCysGluGlySerSerAlaLeuLys                              840845850                                                                      ACATTGGAAATGAATCCTTGTACCCCAAATAATGTTGAGGTTCTGGAA2914                           ThrLeuGl uMetAsnProCysThrProAsnAsnValGluValLeuGlu                              855860865                                                                      ACTCGATCAGCATTTCCTAAAATAGAAGATACAGAAATAATTTCCCCA2962                           ThrArgSerAlaP heProLysIleGluAspThrGluIleIleSerPro                              870875880                                                                      GTAGCTGAGCGTCCTGAAGATCGCTCTGATGCAGAGCCTGAAAACCAT3010                           ValAlaGluArgProGluAsp ArgSerAspAlaGluProGluAsnHis                              885890895900                                                                   GTGGTTGTGTCCTATTGTCCACCCATCATTGAGGAAGAAATACCAAAC3058                           ValValValSerTyrCys ProProIleIleGluGluGluIleProAsn                              905910915                                                                      CCAGCCGCAGATGAAGCTGGAGGGACTGCACAGGTTATTTACATTGAT3106                           ProAlaAlaAspGluAl aGlyGlyThrAlaGlnValIleTyrIleAsp                              920925930                                                                      GTTCAGTCGATGTATCAGCCTCAAGCAAAACCAGAAGAAGAACAAGAA3154                           ValGlnSerMetTyrGlnP roGlnAlaLysProGluGluGluGlnGlu                              935940945                                                                      AATGACCCTGTAGGAGGGGCAGGCTATAAGCCACAGATGCACCTCCCC3202                           AsnAspProValGlyGlyAlaGly TyrLysProGlnMetHisLeuPro                              950955960                                                                      ATTAATTCTACTGTGGAAGATATAGCTGCAGAAGAGGACTTAGATAAA3250                           IleAsnSerThrValGluAspIleAlaAlaGlu GluAspLeuAspLys                              965970975980                                                                   ACTGCGGGTTACAGACCTCAGGCCAATGTAAATACATGGAATTTAGTG3298                           ThrAlaGlyTyrArgProGlnAlaAsnVa lAsnThrTrpAsnLeuVal                              985990995                                                                      TCTCCAGACTCTCCTAGATCCATAGACAGCAACAGTGAGATTGTCTCA3346                           SerProAspSerProArgSerIleAspS erAsnSerGluIleValSer                              100010051010                                                                   TTTGGAAGTCCATGCTCCATTAATTCCCGACAATTTTTGATTCCTCCT3394                           PheGlySerProCysSerIleAsnSer ArgGlnPheLeuIleProPro                              101510201025                                                                   AAAGATGAAGACTCTCCTAAATCTAATGGAGGAGGGTGGTCCTTTACA3442                           LysAspGluAspSerProLysSerAsnG lyGlyGlyTrpSerPheThr                              103010351040                                                                   AACTTTTTTCAGLACAAACCAAACGATTAACAGTGTCACCGTGTCAC3489                            AsnPhePheGlnAsnLysProAsnAsp                                                    1045 1050                                                                      TTCAGTCAGCCATCTCAATAAGCTCTTACTGCTAGTGTTGCTACATCAGCACTGGGCATT3549               CTTGGAGGGATCCTGTGAAGTATTGTTAGGAGGTGAACTTCA3591                                 (2) INFORMATION FOR SEQ ID NO:6:                                               (i) SEQUENCE CHARACTERISTICS:                                                   (A) LENGTH: 1097 amino acids                                                  (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                        MetMetAspIleTyrValCysLeuLysArgProSerTrpMetValAsp                               44- 40-35 -30                                                                  AsnLysArgMetArgThrAlaSerAsnPheGlnTrpLeuLeuSerThr                               25-20-15                                                                       PheIleLeuLeuTyrLeuMetAsnGlnValAsnSerGlnLysLysGly                               10-51                                                                          AlaProHisAspLeuLysCysValThrAsnAsnLeuGlnValTrpAsn                               101520                                                                         CysSerTrpLysA laProSerGlyThrGlyArgGlyThrAspTyrGlu                              253035                                                                         ValCysIleGluAsnArgSerArgSerCysTyrGlnLeuGluLysThr                               40 4550                                                                        SerIleLysIleProAlaLeuSerHisGlyAspTyrGluIleThrIle                               556065                                                                         AsnSerLeuHisAspPheGlySerSerThrSerLys PheThrLeuAsn                              707580                                                                         GluGlnAsnValSerLeuIleProAspThrProGluIleLeuAsnLeu                               9095100                                                                        SerAlaAspPheSerTh rSerThrLeuTyrLeuLysTrpAsnAspArg                              105110115                                                                      GlySerValPheProHisArgSerAsnValIleTrpGluIleLysVal                               120 125130                                                                     LeuArgLysGluSerMetGluLeuValLysLeuValThrHisAsnThr                               135140145                                                                      ThrLeuAsnGlyLysAspThrLeuHisHisTrpSerTrpA laSerAsp                              150155160                                                                      MetProLeuGluCysAlaIleHisPheValGluIleArgCysTyrIle                               165170175180                                                                   AspAsn LeuHisPheSerGlyLeuGluGluTrpSerAspTrpSerPro                              185190195                                                                      ValLysAsnIleSerTrpIleProAspSerGlnThrLysValPhePro                               200 205210                                                                     GlnAspLysValIleLeuValGlySerAspIleThrPheCysCysVal                               215220225                                                                      SerGlnGluLysValLeuSerAlaLeuIl eGlyHisThrAsnCysPro                              230235240                                                                      LeuIleHisLeuAspGlyGluAsnValAlaIleLysIleArgAsnIle                               245250255 260                                                                  SerValSerAlaSerSerGlyThrAsnValValPheThrThrGluAsp                               265270275                                                                      AsnIlePheGlyThrValIlePheAlaGlyTyrProProAspThrPro                                280285290                                                                     GlnGlnLeuAsnCysGluThrHisAspLeuLysGluIleIleCysSer                               295300305                                                                      TrpAsnProGlyArgVa lThrAlaLeuValGlyProArgAlaThrSer                              310315320                                                                      TyrThrLeuValGluSerPheSerGlyLysTyrValArgLeuLysArg                               33033534 0                                                                     AlaGluAlaProThrAsnGluSerTyrGlnLeuLeuPheGlnMetLeu                               345350355                                                                      ProAsnGlnGluIleTyrAsnPheThrLeuAsnAlaHisAsnProLeu                                360365370                                                                     GlyArgSerGlnSerThrIleLeuValAsnIleThrGluLysValTyr                               375380385                                                                      ProHisThrProThrSerPhe LysValLysAspIleAsnSerThrAla                              390395400                                                                      ValLysLeuSerTrpHisLeuProGlyAsnPheAlaLysIleAsnPhe                               405410415 420                                                                  LeuCysGluIleGluIleLysLysSerAsnSerValGlnGluGlnArg                               425430435                                                                      AsnValThrIleLysGlyValGluAsnSerSerTyrLeuVa lAlaLeu                              440445450                                                                      AspLysLeuAsnProTyrThrLeuTyrThrPheArgIleArgCysSer                               455460465                                                                      ThrGluThr PheTrpLysTrpSerLysTrpSerAsnLysLysGlnHis                              470475480                                                                      LeuThrThrGluAlaSerProSerLysGlyProAspThrTrpArgGlu                               485490 495500                                                                  TrpSerSerAspGlyLysAsnLeuIleIleTyrTrpLysProLeuPro                               505510515                                                                      IleAsnGluAlaAsnGlyLysIleLeuSer TyrAsnValSerCysSer                              520525530                                                                      SerAspGluGluThrGlnSerLeuSerGluIleProAspProGlnHis                               535540545                                                                      LysAlaGluIleArgLeuAspLysAsnAspTyrIleIleSerValVal                               550555560                                                                      AlaLysAsnSerValGlySerSerProProSerLysIleAlaSerMet                               565 570575580                                                                  GluIleProAsnAspAspLeuLysIleGluGlnValValGlyMetGly                               585590595                                                                      LysGlyIleLeuLeuThr TrpHisTyrAspProAsnMetThrCysAsp                              600605610                                                                      TyrValIleLysTrpCysAsnSerSerArgSerGluProCysLeuMet                               615620 625                                                                     AspTrpArgLysValProSerAsnSerThrGluThrValIleGluSer                               630635640                                                                      AspGluPheArgProGlyIleArgTyrAsnPhePheLeuTyrGlyCys                               6 45650655660                                                                  ArgAsnGlnGlyTyrGlnLeuLeuArgSerMetIleGlyTyrIleGlu                               665670675                                                                      GluLeu AlaProIleValAlaProAsnPheThrValGluAspThrSer                              680665690                                                                      AlaAspSerIleLeuValLysTrpGluAspIleProValGluGluLeu                               695 700705                                                                     ArgGlyPheLeuArgGlyTyrLeuPheTyrPheGlyLysGlyGluArg                               710715720                                                                      AspThrSerLysMetArgValLeuGluSerGlyArgSer AspIleLys                              725730735740                                                                   ValLysAsnIleThrAspIleSerGlnLysThrLeuArgIleAlaAsp                               745750 755                                                                     LeuGlnGlyLysThrSerTyrHisLeuValLeuArgAlaTyrThrAsp                               760765770                                                                      GlyGlyValGlyProGluLysSerMetTyrValValThrLysGluAsn                                775780785                                                                     SerValGlyLeuIleIleAlaIleLeuIleProValAlaValAlaVal                               790795800                                                                      IleValGlyValValThrSerIleLeu CysTyrArgLysArgGluTrp                              805810815820                                                                   IleLysGluThrPheTyrProAspIleProAsnProGluAsnCysLys                               825830 835                                                                     AlaLeuGlnPheGlnLysSerValCysGluGlySerSerAlaLeuLys                               840845850                                                                      ThrLeuGluMetAsnProCysThrProAsnASnValGluVal LeuGlu                              855860865                                                                      ThrArgSerAlaPheProLysIleGluAspThrGluIleIleSerPro                               870875880                                                                      ValAlaGluArgProG luAspArgSerAspAlaGluProGluAsnHis                              885890895900                                                                   ValValValSerTyrCysProProIleIleGluGluGluIleProAsn                               905 910915                                                                     ProAlaAlaAspGluAlaGlyGlyThrAlaGlnValIleTyrIleAsp                               920925930                                                                      ValGlnSerMetTyrGlnProGlnAlaLys ProGluGluGluGlnGlu                              935940945                                                                      AsnAspProValGlyGlyAlaGlyTyrLysProGlnMetHisLeuPro                               950955960                                                                      IleAs nSerThrValGluAspIleAlaAlaGluGluAspLeuAspLys                              965970975980                                                                   ThrAlaGlyTyrArgProGlnAlaAsnValAsnThrTrpAsnLeuVal                                985990995                                                                     SerProAspSerProArgSerIleAspSerAsnSerGluIleValSer                               100010051010                                                                   PheGlySerProCysSer IleAsnSerArgGlnPheLeuIleProPro                              101510201025                                                                   LysAspGluAspSerProLysSerAsnGlyGlyGlyTrpSerPheThr                               10301035 1040                                                                  AsnPhePheGlnAsnLysProAsnAsp                                                    10451050                                                                       (2) INFORMATION FOR SEQ ID NO:7:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 745 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: CDNA to mRNA                                               (i ii) HYPOTHETICAL: NO                                                        (vii) IMMEDIATE SOURCE:                                                        (B) CLONE: hIgG1Fc                                                             (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 2739                                                             (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                        GGTACCGCTAGCGTCGACAGGCCTAGGATATCGATACGTAGAGCCC46                               ValProLeuAla SerThrGlyLeuGlyTyrArgTyrValGluPro                                 151015                                                                         AGATCTTGTGACAAAACTCACACATGCCCACCGTGCCCAGCACCTGAA94                             ArgSerCysAsp LysThrHisThrCysProProCysProAlaProGlu                              202530                                                                         CTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAACCCAAGGAC142                            LeuLeuGlyGly ProSerValPheLeuPheProProLysProLysAsp                              354045                                                                         ACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGAC190                            ThrLeuMetIleSe rArgThrProGluValThrCysValValValAsp                              505560                                                                         GTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGC238                            ValSerHisGluAspProG luValLysPheAsnTrpTyrValAspGly                              657075                                                                         GTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAAC286                            ValGluValHisAsnAlaLysThrLys ProArgGluGluGlnTyrAsn                              80859095                                                                       AGCACGTACCGGGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGG334                            SerThrTyrArgValValSerVal LeuThrValLeuHisGlnAspTrp                              100105110                                                                      CTGAATGGCAAGGACTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCA382                            LeuAsnGlyLysAspTyrLysCy sLysValSerAsnLysAlaLeuPro                              115120125                                                                      GCCCCCATGCAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAA430                            AlaProMetGlnLysThrIleSer LysAlaLysGlyGlnProArgGlu                              130135140                                                                      CCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAAC478                            ProGlnValTyrThrLeuProProSerAr gAspGluLeuThrLysAsn                              145150155                                                                      CAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGGCACATC526                            GlnValSerLeuThrCysLeuValLysGlyPheTyrP roArgHisIle                              160165170175                                                                   GCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACC574                            AlaValGluTrpGluSerAsnGlyGlnProGlu AsnAsnTyrLysThr                              180185190                                                                      ACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAG622                            ThrProProValLeuAspSerAspGlySerPhe PheLeuTyrSerLys                              195200205                                                                      CTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGC670                            LeuThrValAspLysSerArgTrpGlnGlnGlyAs nValPheSerCys                              210215220                                                                      TCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTC718                            SerValMetHisGluAlaLeuHisAsnHisTyrThrGlnL ysSerLeu                              225230235                                                                      TCCCTGTCTCCGGGTAAATGAACTAGT746                                                 SerLeuSerProGlyLys                                                             240245                                                                         (2) INFORMATION FOR SEQ ID NO:8:                                                (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 245 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                        ValProLeuAlaSerThrGlyLeuGlyTyrArgTyrValGluProArg                               15 1015                                                                        SerCysAspLysThrHisThrCysProProCysProAlaProGluLeu                               202530                                                                         LeuGlyGlyProSerValPheLeuPheProProLysP roLysAspThr                              354045                                                                         LeuMetIleSerArgThrProGluValThrCysValValValAspVal                               505560                                                                         SerHisGluAsp ProGluValLysPheAsnTrpTyrValAspGlyVal                              65707580                                                                       GluValHisAsnAlaLysThrLysProArgGluGluGlnTyrAsnSer                               85 9095                                                                        ThrTyrArgValValSerValLeuThrValLeuHisGlnAspTrpLeu                               100105110                                                                      AsnGlyLysAspTyrLysCysLysVa lSerAsnLysAlaLeuProAla                              115120125                                                                      ProMetGlnLysThrIleSerLysAlaLysGlyGlnProArgGluPro                               130135140                                                                       GlnValTyrThrLeuProProSerArgAspGluLeuThrLysAsnGln                              145150155160                                                                   ValSerLeuThrCysLeuValLysGlyPheTyrProArgHisIleAla                                165170175                                                                     ValGluTrpGluSerAsnGlyGlnProGluAsnAsnTyrLysThrThr                               180185190                                                                      ProProValLeuAsp SerAspGlySerPhePheLeuTyrSerLysLeu                              195200205                                                                      ThrValAspLysSerArgTrpGlnGlnGlyAsnValPheSerCysSer                               210215 220                                                                     ValMetHisGluAlaLeuHisAsnHisTyrThrGlnLysSerLeuSer                               225230235240                                                                   LeuSerProGlyLys                                                                245                                                                        

We claim:
 1. A substantially homogeneous purified LIF receptor (LIF-R) protein comprising an amino acid sequence selected from the group consisting of amino acids 1-957 of SEQ ID NO:2, amino acids 1-45 of SEQ ID NO: 2, amino acids 1-76 of SEQ ID NO:4, and amino acids 1-1053 of SEQ ID NO:6.
 2. A substantially homogeneous purified LIF receptor (LIF-R) protein capable of binding LIF, wherein said LIF-R in encoded by a DNA selected from the group consisting of:(a) a DNA comprising the coding region of the DNA sequence of SEQ ID NO: 1; (b) A DNA comprising the coding region of the DNA sequence of SEQ ID NO: 3; (c) a DNA comprising the coding region of the DNA sequence of SEQ ID NO: 5; (d) a DNA that hybridize to the complement of a DNA of (a), (b) or (c) under moderately stringent conditions and which encodes a LFF-R capable of binding LIF; and (e) A DNA that is degenerate as a result of the genetic codd to a DNA of (a), (b), (c), or (d).
 3. A purified LIF-R protein according to claim 2, wherein said LIF-R protein is a human LIF-R protein comprising an amino acid sequence selected from the group consisting of amino acids 1-957 of SEQ ID NO: 1, amino acids 1-945 of SEQ ID NO: 1, and amino acids 1-1053 of SEQ ID NO: 5, except for modification(s) selected from the group consisting of:a) inactivation of N-glycosylation site(s); b) inactivation of KEX2 protease processing site(s); c) addition of an N-terminal methionine residue; and d) conservative amino acid substitution(s).
 4. A homodimeric receptor comprising two fusion proteins joined via disulfide bonds, wherein each fusion protein comprises a human LIF-R polypeptide fused to an antibody Fc polypeptide, wherein said LIF-R polypeptide comprises an amino acid sequence extending from amino acids x to y of SEQ ID NO:1, wherein x is 1 to 11 and y is 479 to 789, wherein said fusion protein may additionally comprise a peptide linker between said LIF-R polypeptide and said Fc polypeptide.
 5. A homodimeric receptor according to claim 4, wherein x is 1 and y is selected from the group consisting of 702, 775, and
 789. 6. A composition comprising a LIF receptor (LIF-R) protein according to claim 4 and a suitable diluent or carrier. 